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Description 

CHROMOSOME I GENE AND GENE PRODUCTS 
RELATED TO ALZHEIMER'S DISEASE 

5 

Technical Field 

The present invention relates generally to Alzheimer's disease, and more 
specifically to methods and compositions for use in diagnosis and treatment of 
Alzheimer's Disease. 

10 

Background of the Invention 

Alzheimer's disease (AD) is a devastating, neurodegenerative 
progressive disorder first recognized in 1907 (Alzheimer, Algemeine Zeitschrift fur 
Psychiatric 64: 146- 148, 1907). AD is a common disease in the elderly and is the 
1 5 predominant cause of dementia in people over 65 years of age. The prevalence of AD 
is estimated to he as high as 18.7% and 47.2% for the 75-84 year and > 85 year age 
groups respectively. Thus, there is a large affected population in most countries of the 
world. 

Clinical symptoms of the disease typically begin with subtle short term 

20 memory problems. As the disease progresses, difficulty with memory, language, and 
orientation worsen to the point of interfering with the ability of the person to function 
independently. Other symptoms, which are variable, include myoclonus and seizures. 
Duration of AD from the first symptoms of memory loss until death is 10 years on 
average, but may range from 6-8 years to more than 20 years. AD always results in 

25 death, often from respiratory-related illness. 

The pathology in AD is confined exclusively to the central nervous 
system (CNS). The predominant features arc amyloid deposits (plaques) and 
neurofibrillary tangles (NFT). In AD, amyloid is found associated with the vascular 
system of the CNS and as focal deposits in the parenchyma. The major molecular 

30 component of amyloid is a highly hydrophobic peptide called the AP peptide. This 
peptide aggregates into filaments in an anti-P-plcated sheet structure resulting in the 
birefringent nature of the AD amyloid. While Ap is the major component of AD 
amyloid, a partial list of other proteins associated with the amyloid includes a-l-anti- 
chymotrypsin (Abraham, et al.. Cell 52:487-501, 1988), cathepsin D (Cataldo, et al., 

35 Brain Res. 573:181-192. 1990), non-amyloid component protein (Ueda. et al., Proc. 
Natl Acad ScL USA 90:11282-11286, 1993), apo lipoprotein E (apoE) (Namba, et al.. 
Brain Res. 541: 163-166, 1991; Wisniewski and Frangione. NeuroscL Lett. 
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755:235-238, 1992; Strittmatter, et al., Proc. Nat. Acad. Sci USA 90:1977-1981, 1993), 
apolioprotein J (Choi-Mura, et a).. Acta Neuropathoi 55:260-264, 1992; McGeer, et al., 
Brain Res. 570:337-341, 1992), heat shock protein 70 (Hamos, ct a!.. Neurology 
47:345-350, 1991), complement components (McGeer and Rogers, Neurology 
5 45:447-449, 1992), a 2 _macroglobin (Strauss, et al.. Lab. Invest. 6(5:223-230, 1992), 
interleukin-6 (Strauss, et al., Lab. Invest. 66:223-230, 1992), proteoglycans (Snow, et 
al.. Lab. Invest. 5#:454-458, 1987), and scrum amyloid P (Coria, cl al., Lab. Invest. 
55:454-458, 1988). Surrounding many plaques are dystrophic neurites, which are nerve 
endings containing abnormal filamentous structures. Plaques are often surrounded by 

10 astrocytes and activated microglial cells expressing immune-related proteins, such as 
the MHC class 11 glycoproteins HLA-DR, HLA-DP, and HLA-DQ as well as MHC 
class 1 glycoproteins, interleukin-2 (IL-2) receptors, and IL-1. The other dominant 
feature of AD neuropathology is the presence of NFTs. These consist of abnormal 
filaments bundled together in neuronal cell bodies. "Ghost* NFTs are also observed in 

15 AD brains, which presumably mark the location of dead neurons. Other 
neuropathological features include granulovacuolar changes, neuronal loss, gliosis and 
the variable presence of Lewy bodies. 

In the AD brain, the destructive process of the disease is evident on a 
gross level. In the late-stage of AD, ventricular enlargement and shrinkage of the brain 

20 can be observed by magnetic resonance imaging. On autopsy, extensive gliosis and 
neuronal loss are observed. Thus, the amyloid plaque structures and NFTs observed at 
autopsy are most likely the end-points of a lengthy disease process, far removed from 
the initiating events of AD. Also, the cells remaining at autopsy are grossly different 
from those of a normal brain. Neurons, which were possibly involved in initiating 

25 events, arc absent and other cell types, such as the activated microglial cells and 
astroctyes, have gene expression patterns not observed in the normal brain. Thus, 
attempts using biochemical methods to identify key proteins and genes in the initialing 
steps of the disease are hampered by the fact that it is not possible to actually observe 
these critical initiating events. Rather, biochemical dissection of the AD brain at 

30 autopsy is akin to molecular archeology, attempting to reconstruct the pathogenic 
pathway by comparing the normal brain to the end-stage disease brain. 

Substantial evidence has suggested that inherited genetic defects are 
involved in AD. Numerous early-onset kindreds have been described (Bird, et al., Ann. 
NeuroL 25:25-31, 1988; Bird, et al., Ann. Neurol. 25:12-25, 1989; Cook, ct al., 

35 Neurology 29:1402-1412, 1979; Feldman, et aL, Neurology 75:811-824, 1960; 
Goudsmit, J. Neurol. ScL 49:79-. 1981; Heston and White, Behavior Genet. 8:3 15-331, 
1978; Martin, et al.. Neurology 47:62-68, 1991; Nee, et al., Arch. NeuroL 40:203-208, 
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1983; van BogaeerU et aL, Mschr. Psychait. Neurol. 702:249-301, 1940; Wheelan, Ann. 
Hum. Genet 25:300-309, 1959). (Early-onset is defined herein as onset prior to 65 
years.) Families with multiple late-onset AD cases have also been described (Bird, et 
ah, Ann. Neurol. 25:12-25, 1989; Heston and White, Behavior Genet. 5:315-331, 1978; 
5 Pericak-Vance, et aL, Exp. Neurol. 702:271-279, 1988). In addition, twin studies have 
documented that monozygotic twins are more concordant in their AD phenotype than 
dizygotic twins (Nee, et aL, Neurology 37:359-363, 1987; [133] ). Also, the families of 
concordant twins have more secondary cases of AD than families of discordant twins 
(Rapoport et aL, Neurology 41: 1 549- 1 553, 1 99 1 ). 

10 Genetic dissection of AD has been complicated by the complexity of the 

disease and the limited accuracy of its diagnosis. Because AD is common in the 
elderly, clustering of cases in a family may occur by chance, representing possible 
confounding non-allelic genetic heterogeneity, or etiologic heterogeneity with genetic 
and non-genetic cases co-existing in the same kindred. In addition, the clinical 

1 5 diagnosis of AD is confounded with other dementing diseases common in the elderly. 

Despite the problems associated with resolving complex genetic 
diseases, 2 causative AD loci and 1 risk-modifying gene have been identified. 
Mutations in the amyloid precursor protein (APP) gene on chromosome 21 cause early- 
onset (< 65 years) autosomal dominant AD (Goate, et aL. Nature 349:704, 1991). 

20 Mutations in a recently identified gene (AD3) on chromosome 14 also result in early- 
onset autosomal dominant AD (Schellenberg, et aL, Science 258:66%, 1992; 
Sherrington, et aL, Nature 575:754-760, 1995). For late-onset AD, the APOE gene has 
been identified as a genetic modifying factor (Strittmatter, et aL. Proc, Natl. Acad Sci. 
USA 00:1977, 1993; Corder, et aL. Science 267:921. 1993; Corder, et aL, Nat. Genet. . 

25 7:180-184, 1994; Benjamin, et aL, Lancet 344:473, 1994; Smith, ct aL, Lancet 
344:473-474. 1994). 

The known genetic loci for AD do not account for all cases of AD. For 
example, in late-onset AD approximately half of AD cases do not have the APOE e4 
allele (Brousseau, et aL, Neurology 342, 1994; Kuusisto. et aL. Brit. Med. J. 500:636. 

30 1994; Tsai, et aL, Am. J. Hum. Genet. 54: 643, 1994; Liddel, el aL, J. Med Genet. 
57.197. 1994). Also, in the Volga German (VG) kindreds (Cook, et aL, Neurology 
20:1402, 1979; Bird, et zl^Ann. Neurol. 25/25, 1988; Bird, et al. Ann. Neurol. 25:12. 
1989; Bird, Am. Hist. Soc. Germ. Russia J. 40:1991; Bird, et aL, in Heterogeneity of 
Alzheimer's Disease, F. Boiler, et aL, Eds. (Spring- Verlag, Heidelberg. 1992) pp. 

35 118-129), as in several other families with high incidence of AD, the known AD loci 
have been excluded as possible causes (Schellenberg, et aL, Science 255:668, 1992; 
Lannfelt, et aL, Nat. Genet. 4:218-219, 1993; van Duijn, et aL, Am. J. Hum. Genet. 
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55:714-727, 1994; Schellenberg, et al., Science 241:1507, 1988; Schellenberg, et aL, 
Am, J. Hum. Genet. 48:563, 1991 ; Schellenberg, et al.. Am. 1 Hum. Genet. 49:51 1-5 1 7^ 
1991; Kamino, et al., Am. J. Hum. Genet. 5/:998, 1992; Schellenberg, et al.. Am 1 
Hum. Genet. 55:619, 1993; [13]; Schellenberg, claLAnn. Neurol 31:223. 1992; Yu, et 
5 al.. Am. J. Hum Genet. 5*631, 1994). Identification of new genes should add 
considerably to the unfolding of the genetic determinants and enable biochemical and 
genetic approaches to diagnosis and treatment. 

The present invention provides a novel, previously unidentified locus for 
AD, methods and compositions for diagnosis and treatment of AD, and further provides 
1 0 other, related advantages. 

Summary of the Invention 

Briefly stated, the present invention provides isolated nucleic acid 
molecules encoding an AD4 (also known as STM2) gene product. Within one 
15 embodiment, a representative nucleic acid molecule is provided in Figures 1A and B 
(hereinafter referred to as Figure 1 ). Within other embodiments, nucleic acid molecules 
are provided which encode a mutant AD4 gene product that increases the probability of 
Alzheimer's Disease (in a statistically significant mariner). One representative 
illustration of such a mutant is an amino acid subsitution at residue 141, wherein, for 

20 example, an isoleucine may be substituted for an asparaginc. 

Within other aspects of the present invention, isolated nucleic acid 
molecules are provided, selected from the group consisting of (a) an isolated nucleic 
acid molecule as set forth in Figure 1 or complementary sequence thereof, (b) an 
isolated nucleic acid molecule that specifically hybridizes to the nucleic acid molecule 

25 of (a) under conditions of high stringency, and (c) an isolated nucleic acid that encodes 
an AD4 gene product. As utilized herein, it should be understood that a nucleic acid 
molecule hybridizes "specifically" to an AD4 gene (or related sequence) if it hybridizes 
detectably to such a sequence, but does not significantly or detectably hybridize to the 
AD3 gene sequence under the same conditions. Within other aspects, isolated DNA 

30 molecules are provided that contain genomic sequences, such as the sequences 
presented in Figures 13-19. 

Within other aspects, expression vectors are provided comprising a 
promoter operably linked to one of the nucleic acid molecule described above. 
Representative examples of suitable promoters include tissue-specific promoters, as 

35 well as promoters such as the CMV I-E promoter, SV40 early promoter and MuLV 
LTR. Within related aspects, viral vectors are provided that are capable of directing the 
expression of a nucleic acid molecule as described above. Representative examples of 
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such viral vectors include herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. Also provided are host cells (e.g.* 
human, dog, monkey, rat or mouse cells) which carry the above-described vectors. 

Within other aspects of the present invention, isolated proteins or 
5 polypeptides are provided comprising an AD4 gene product, as well as AD4 peptides of 
greater than 12, 13, or 20 amino acids. Within one embodiment, a protein is provided 
that has the amino acid sequence set forth in Figure 2. Within another embodiment, the 
protein is a mutant AD4 gene product that increases the probability of Alzheimer's 
Disease. Such mutants include those with an amino acid subsitution at residue 141 

10 (e.g., an aspargine to isoleucine substitution). Within yet a further embodiment, AD4 
peptides are provided which are composed of 13 to 20 amino acids derived or selected 
from the N-terminal, internal, or carboxy-terminal hydrophilic regions. 

Within yet another aspect of the present invention, methods of treating 
or preventing Alzheimer's Disease are provided, comprising the step of administering to 

15 a patient a vector containing or expressing a nucleic acid molecule as described above, 
thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in the 
patient. Within a related aspect, methods of treating or preventing Alzheimer's Disease 
are provided, comprising the step of administering to a patient a protein as described 
above, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in 

20 the patient. Within yet another related aspect, methods are provided for treating or 
preventing Alzheimer's Disease comprising the step of administering to a patient an 
antibody specific for an AD4 protein as described above, thereby reducing the 
likelihood or delaying the onset of Alzheimer's Disease in the patient. Within certain 
embodiments, the above methods may be accomplished by in vivo administration. 

25 Also provided by the present invention are pharmaceutical compositions 

comprising a nucleic acid molecule, vector, host cell, protein, or antibody as described 
above, along with a pharmaceutically acceptable carrier or diluent. 

Within other aspects of the present invention, antibodies are provided 
which specifically bind to an AD4 protein, or to unique peptides derived from the N- 

30 terminal, internal, or carboxy-terminarhydrophilic regions. As utilized herein, it should 
be understood that an antibody is specific for an AD4 protein if it binds detectably, and 
with a Ka of 10" 7 M or less (e.g., 10~ 8 M, 10" 9 M, etc.), but does not bind detectably (or 
with an affinity of greater than 10" 7 M, (e.g., 10"^M, I0*" 5 M, etc.) to the AD3 protein. 
Also provided are hybridomas which are capable of producing such antibodies. 

35 Within other aspects of the present invention, nucleic acid probes are 

provided which are capable of specifically hybridizing (as defined above) to an AD4 
gene under conditions of high stringency. Within one related aspect, such probes 
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comprise at least a portion of the nucleotide sequence shown in Figure 1 or its 
complementary sequence, the probe being capable of specifically hybridizing to a 
mutant AD4 gene under conditions of high stringency. Within one particularly 
preferred aspect, probes are provided that are capable of specifically hybridizing to a 
5 mutant AD4 gene in which amino acid residue 141 is changed from asparagine to 
isoleucine, under conditions of very high stringency. Within other related aspects, 
probe are provided which are capable of specifically hybridizing to at least a portion of 
the nucleic sequence shown in any of Figures 13-19. Representative probes of the 
present invention are generally at least 12 nucleotide bases in length, although they may 

10 be 14, 16, 18 bases or longer. Also provided are primer pairs capable of specifically 
amplifying all or a portion of any of the nucleic acid molecules disclosed herein. 

Within other aspects of the invention, methods arc provided for 
diagnosing a patient having an increased likelihood of contracting Alzheimer's Disease, 
comprising the steps of (a) obtaining from a patient a biological sample containing 

15 nucleic acid, (b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant ADA gene under conditions and for time sufficient 
to allow hybridization to occur, and (c) detecting the presence of hybridized probe, and 
thereby determining that said patient has an increased likelihood of contracting 
Alzheimer's Disease. Within another aspect, methods are provided comprising the 

20 steps of (a) obtaining from a patient a biological sample containing nucleic acid, (b) 
amplifying a selected nucleic acid sequence associated with a mutant AD4 gene, and (c) 
detecting the presence of an amplified nucleic acid sequence, and thereby determining 
that the patient has an increased likelihood of contracting Alzheimer's Disease. Within 
yet another aspect, methods are provided comprising the steps of (a) contacting a 

25 biological sample obtained from a patient with an antibody that specifically binds to a 
mutant AD4 protein, under conditions and for a time sufficient to allow binding of the 
antibody to the protein, and (b) detecting the presence of the bound antibody. Suitable 
biological samples include nucleated cells obtained from the peripheral blood, from 
buccal swabs, or brain tissue. 

30 Within another aspect, peptide vaccines are provied which comprise a 

portion of a mutant AD4 gene product containing a mutation, in combination with a 
pharmaceutically acceptable carrier or diluent. 

Within yet another aspect, transgenic animals are provided whose germ 
cells and somatic cells contain an AD4 gene which is operably linked to a promoter 

35 effective for the expression of the gene, the gene being introduced into the animal, or an 
ancestor of the animal, at an embryonic stage. Within one embodiment, the animal is a 
mouse, rat or dog. Within other embodiments, the AD4 gene is expressed from a vector 
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as described above. Within yet another embodiment, the AIM gene encodes a mutant 
AD4 gene product. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
5 various references are set forth herein which describe in more detail certain procedures 
or compositions (e.g., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 

Brief Description of the Drawing 
1 0 Figures 1 A and 1 B depict the nucleotide sequence of an AIM gene. 

Figure 2 depicts the amino acid sequence of an AD4 gene product. 
Figure 3 depicts the sequence alignment of a representative AIM gene 
product to the AD3 gene product. Vertical bars between the sequences indicate 
sequence identity. Colons between the sequences indicate strong sequence similarity. 
15 Dots between the sequences indicate medium sequence similarity. Dots inserted into 
either sequence indicate gaps inserted into a sequence so as to maximize the score of 
the alignment. 

Figure 4 depicts a hydropathy plot of an AD4 gene product. The plot 
indicates that there are 7 putative transmembrane regions of this gene product, and 
20 indicates a mutation at residue 141 (asparaginic to isoleucinc). 

Figure 5 depicts certain sequence features of a representative AD4 gene 
product, including a conserved sequence of 4 amino acids (KPGL) in positions 202-205 
and 251-254, each of which lie at the C-terminus of a putative membrane spanning 
regions and are conserved in the human and mouse AD3 gene products. Figure 5 also 
25 depicts a putative glycosylation site (asparagine 366). 

Figure 6 is a dotplot which compares the human AD3 gene product and 
a representative AD4 gene product. The dark blocks indicate sequence similarity 
between the putative membrane spanning regions. 

Figure 7 depicts a family tree showing the segregation of chromosome 1 
30 markers in the R family. The bars beneath each subject represent minimum 
recombinant haplotypes constructed using the 23 markers from D1S238 to D1S235 
(Table 2). Alleles shown (from top to bottom) are for D1S249, D1S237, D1S479. 
D1S439 and D1S225, where A is the largest published allele. The solid bar represents 
the haplotypes segregating with the disease. Other haplotypes are represented by bars 
35 with different hatching or marking. A partially missing bar indicates markers not 
genotyped (subject IV- 15). A " M symbol represents regions where the phase cannot be 
assigned. APOE genotypes are shown directly above the haplotypes with 3 and 4 being 
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the e3 and e4 alleles, respectively. Recombinants were observed on the disease 
haplotypes between D1S479/D1S439 and D1S225 in subjects V-l and V-3, and 
between D1S306 and D1S310 in subject IV-18 (not shown). In IV-16 there is a 
recombination between D1S217 and D1S237. A total of 4 recombinations (3 on non- 
5 disease related chromosomes) were observed between D1S249 and D1S237 in the 27 
chromosomes in generation IV. The distance between these markers is 12 cM and the 
chance that up to 4 recombinants would be observed is 41%. 

Figure 8 depicts the location of chromosome 1 markers. Distances are 
given in centimorgans (sex-averaged) with the use of the Kosambi map function. The 3 
1 0 maps are not independent as all contain some common genotypes. 

Figure 9 sets forth the sequences of primers used to amplify 
polymorphic regions of chromosome I. 

Figure 10 presents the DNA sequence of clone 788iih5, EST T03796 
and clones Ell 1.1 and ELL 1.2. 
15 Figure 11 presents the DNA sequences of primers used in PCR or 

sequencing of the AIM gene sequence. 

Figure 12 presents an ethidium bromide stained gel of genomic DNA 
amplified with WWF and INTIR primers. Lanes are: 1 , at-risk, 79 years; 2, affected, 
onset 60 years; 3, at-risk, 67 years of age; 4, affected, onset 75 years (presumed 
20 phenocopy); 5, Boehringer Mannheim DNA size standard V; 6, affected, onset at 52 
years; 7, affected, onset 46 years; 8, affected, onset 49 years. 

Figure 1 3 presents a partial genomic DNA sequence of AD4. 

Figure 14 presents a partial genomic DNA sequence of AD4. 

Figure 15 presents a partial genomic DNA sequence of AD4. 
25 Figure 16 presents a partial genomic DNA sequence of AD4. 

Figure 17 presents a partial genomic DNA sequence of AD4. 

Figure 18 presents a partial genomic DNA sequence of AD4. 

Figure 19 presents a partial genomic DNA sequence of AD4. 

Figure 20 shows expression pattern of STM2 in different tissues. 

30 

Detailed Description of the Invention 
Definitions 

Prior to setting forth the invention in detail, it may be helpful to an 
35 understanding thereof to set forth definitions of certain terms and to list and to define 
the abbreviations that will be used hereinafter. 
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" Genetic marker " is any segment of a chromosome that is 
distinguishably unique in the genome, and polymorphic in the population so as to 
provide information about the inheritance of linked DNA sequences, genes and/or other 
markers. 

5 " Autosomal dominant " means that a trait is encoded on one of the non- 

sex chromosomes (autosomes) and is dominant for the phenotype it dictates for an 
individual having a heterozygous state. 

" LOP score " is a standard measure in genetics of the likelihood of a trait 
being localized in the interval being scored. Is the logarithm of a calculated probability. 
10 " Vector " refers to an assembly which is capable of directing the 

expression of an AD4 gene, as well as any additional sequence(s) or gene{s) of interest. 
The vector must include transcriptional promoter elements which are operably linked to 
the genes of interest. The vector may be composed of either deoxyribonucleic acids 
("DNA"), ribonucleic acids ("RNA"), or a combination of the two (e.g., a DNA-RNA 

15 chimeric). Optionally, the vector may include a polyadenylation sequence, one or more 
restriction sites, as well as one or more selectable markers such as neomycin 
phosphotransferase or hygromycin phosphotransferase. Additionally, depending on the 
host cell chosen and the vector employed, other genetic elements such as an origin of 
replication, additional nucleic acid restriction sites, enhancers, sequences conferring 

20 inducibility of transcription, and selectable markers, may also be incorporated into the 
vectors described herein. 

Abbreviations : AD, Alzheimer's disease; APP, amyloid precursor 
protein gene; APLP1 and APLP2, amyloid precursor like proteins: CNS, central 
nervous system; DS, Down syndrome; FAD, familial AD; HCHWA-D, hereditary 

25 cerebral hemorrhage with amyloidosis - Dutch type: (Z max ), maximum LOD score; 
NFTs, neurofibrillary tangles; STRP, short tandem repeat' polymorphism; 0, 
recombination fraction; VG, Volga German; AD3, the designation given to the 
chromosome 14 early-onset FAD gene; YAC, yeast artificial chromosome; EST, 
expressed sequence tag; PCR, polymerase chain reaction; RT-PCR, PCR process in 

30 which RNA is first transcribed into DNA at the first step using reverse transcriptase 
(RT); cDNA, any DNA made by copying an RNA sequence into DNA form. 

As noted above, the present invention provides methods and 
compositions for the detection and treatment of Alzheimer's Disease. These methods 
35 and compositions are based upon the discovery of a gene which is located on 
chromosome 1, which when mutated increases the probability of Alzheimer's Disease. 
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This gene, designated AD4, was found to be located on chromosome 1 at locus lq31- 
42. 

Although one embodiment of an AD4 gene is disclosed in Figure 1, it 
should be understood that the present invention is not so limited. In particular, within 
5 the context of the present invention reference to the AD4 gene should be understood to 
include derivatives, analogs, or allelic variants of the gene disclosed in Figure 1 that are 
substantially similar. As used herein, a nucleic acid molecule is deemed to be 
"substantially similar" if: (a) the nucleotide sequence is derived from the coding region 
of the described gene and includes portions of the sequence or allelic variations of the 
10 sequences discussed above; (b) the nucleotide sequence is capable of hybridization to 
nucleotide sequences of the present invention under high or very high stringency (see 
Sambrook et al., Molecular Cloning; A Laboratory Manual, 2nd ed.. Cold Spring 
Harbor Laboratory Press, NY. 1989); or (c) the DNA sequences are degenerate as a 
result of the genetic code to the DNA sequences defined in (a) or (b). In addition, the 
15 genomic sequence may contain only polymorphisms, either in the coding or non- 
translated region, or insertions, deletions and the like. Further, the AD4 gene includes 
both complementary and non-complementary sequences, provided the sequences 
otherwise meet the criteria set forth herein. Within the context of the present invention, 
high stringency means standard hybridization conditions (e.g., 5x SSPE, 0.5% SDS at 
65°C, or the equivalent), such that an appropriate nucleotide sequence is able to 
selectively hybridize to nucleotide sequences from the AD-related gene, and to mutant 
nucleotide sequences. Very high stringency means the nucleotide sequence is able to 
selectively hybridize to a single allele of the AD-related gene. 

The AD4 gene may be isolated from genomic DNA or cDNA. Genomic 
25 DNA libraries constructed in vectors, such as YACs (yeast artificial chromosomes), 
bacteriophage vectors, such as XEMBL3, XgtlO, cosmids or plasmids, are suitable for 
use. cDNA libraries constructed in bacteriophage vectors, plasmids, or others, are 
suitable for screening. Such libraries may be constructed using methods and techniques 
known in the art (see Sambrook et ah, Molecular Cloning: A Laboratory Manual, Cold 
30 Spring Harbor Press, 1989) or purchased from commercial sources (e.g., Clontech). 
Within one embodiment, the AD4 gene is isolated by PCR performed on genomic 
DNA, cDNA or libraries, or by probe hybridization of genomic DNA or cDNA 
libraries. Primers for PCR and probes for hybridization screening may be designed 
based on the DNA sequence of AD4 presented herein. The DNA sequence of an AD4 
35 gene is presented in Figure 1A and B; and the predicted amino acid sequence is 
presented in Figure 2. Primers for PCR should be derived from sequences in the 5' and 
3' untranslated region in order to isolate a full-length cDNA. The primers should not 
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have self-complementary sequences nor have complementary sequences at their 3' end 
(to prevent primer-dimer formation). Preferably, the primers have a GC content of 
about 50% and contain restriction sites. The primers are annealed to cDNA and 
sufficient cycles of PCR are performed to yield a product readily visualized by gel 
5 electrophoresis and staining. The amplified fragment is purified and inserted into a 
vector, such as A.gtlO or pBS(M13+), and propagated. An oligonucleotide 
hybridization probe suitable for screening genomic or cDNA libraries may be designed 
based on the sequence provided herein. Preferably, the oligonucleotide is 20-30 bases 
long. Such an oligonucleotide may be synthesized by automated synthesis. The 

10 oligonucleotide may be conveniently labeled at the 5' end with a reporter molecule, 
such as a radionuclide, (e.g., ^ 2 P) or biotin. The library is plated as colonies or phage, 
depending upon the vector, and the recombinant DNA is transferred to nylon or 
nitrocellulose membranes. Following denaturation. neutralization, and fixation of the 
DNA to the membrane, the membranes are hybridized with the labeled probe. The 

1 5 membranes are washed and the reporter molecule detected. The hybridizing colonies or 
phage are isolated and propagated. Candidate clones or PCR amplified fragments may 
be verified as containing AD4 DNA by any of various means. For example, the 
candidate clones may be hybridized with a second, nonoverlapping probe or subjected 
to DNA sequence analysis. In these ways, clones containing AD4 gene, which are 

20 suitable for use in the present invention are isolated. 

The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot function of, for example, P/C Gene or Intelligenetics Suite 
(Intelligenetics, Mountain View, Calif.), or according to the methods described by Kyte 

25 and Doolittlc (J. MoL BioL 757:105-132, 1982). One such structure is depicted in 
Figure 4, which is a hydropathy plot of an AD4 gene product. As is evident from this 
figure, the plot depicts a protein with an extracellular domain, three extracellular loop 
domains, three intracellular loop domains, seven transmembrane regions, and an 
intracellular domain. 

30 AD4 proteins of the present invention may be in the form of acidic or 

basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 
which is to retain or further enhance or decrease the biological activity of the mutant or 

35 wild-type protein. Moreover, due to degeneracy in the genetic code, there may be 
considerable variation in nucleotide sequences encoding the same amino acid sequence. 
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Other derivatives of the AD4 proteins disclosed herein include 
conjugates of the proteins along with other proteins or polypeptides. This may be 
accomplished, for example, by the synthesis of N-terminal or C-terminal fusion 
proteins which may be added to facilitate purification or identification of AD4 proteins 
5 (see U.S. Patent No. 4,851,341, see also, Hopp ct ah, Bio/Technology tf:1204, 1988.) 
Alternatively, fusion proteins such as AD4-J*galactosidase or AD4-luciferase may be 
constructed in order to assist in the identification, expression, and analysis of AD4 
proteins. 

AD4 proteins of the present invention may be constructed using a wide 
10 variety of techniques including for example, that described in Example 5. Further, 
mutations may be introduced at particular loci by synthesizing oligonucleotides 
containing a mutant sequence, flanked by restriction sites enabling ligation to fragments 
of the native sequence. Following ligation, the resulting reconstructed sequence 
encodes a derivative having the desired amino acid insertion, substitution, or deletion. 

15 Alternatively, oligonucleotide-directed site-specific (or segment 

specific) mutagenesis procedures may be employed to provide an altered gene having 
particular codons altered according to the substitution, deletion, or insertion required. 
Exemplary methods of making the alterations set forth above are disclosed by Walder 
etal. (Gene 42:133, 1986); Bauer ctal. (Gene 37:73, 1985); Craik (BioTechniques. 

20 January 1985, 12-19); Smith etal. (Genetic Engineering: Principles and Methods. 
Plenum Press, 1981); and Sambrook et al. (supra). Deletion or truncation derivatives 
of AD4 proteins (e.g., a soluble extracellular portion) may also be constructed by 
utilizing convenient restriction endonuclease sites adjacent to the desired deletion. 
Subsequent to restriction, overhangs may be filled in, and the DNA relegated. 

25 Exemplary methods of making the alterations set forth above are disclosed by 
Sambrook et al. (Molecular Cloning: A Laboratory Manual 2d Ed., Cold Spring 
Harbor Laboratory Press, 1989). 

Mutations, which are made in the nucleic acid molecules of the present 
invention preferably preserve the reading frame of the coding sequences. Furthermore, 

30 the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For example, in 
order to select for optimum characteristics of mutants at a given site, random 

35 mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 



BNSDOCID: <WO 97031 92A3_IA> 




WO 97/03192 PCT/US96/1 1386 

13 

by restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

AD4 proteins may also be constructed utilizing techniques of PCR 
5 mutagenesis, chemical mutagenesis (Drinkwater and Klinedinst, PNAS #5:3402-3406, 
1986), by forced nucleotide misincorporation (e.g., Liao and Wise Gene A?:I07-I11, 
1990), or by use of randomly mutagenized oligonucleotides (Horwitz et aL, Genome 
5:1 12-1 17, 1989). Particularly preferred methods for constructing Alzheimer Disease 
Proteins are set forth in more detail below in the Examples. 

10 Proteins can be isolated by, among other methods, culturing suitable 

host and vector systems to produce the recombinant translation products of the present 
invention. Supernatants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated by a variety of 
purification procedures in order to isolate the desired proteins. For example, the 

15 supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 

20 protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated" within the contexi of the present 
25 invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
can be isolated such that no other (undesired) protein is delected pursuant to SDS- 
PAGE analysis followed by silver staining. 

The present invention also provides for the manipulation and expression 
30 of the above described genes by culturing host cells containing a vector capable of 
. expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding AD4 proteins, which are 
operably linked to suitable transcriptional or translational regulatory elements. Suitable 
regulatory elements may be derived from a variety of sources, including bacterial, 
35 fungal, viral, mammalian, insect, or plant genes. Selection of appropriate regulatory 
elements is dependent on the host cell chosen, and may be readily accomplished by one 
of ordinary skill in the art. Examples of regulatory elements include: a transcriptional 
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promoter and enhancer or RNA polymerase binding sequence, a transcriptional 
terminator, and a ribosomal binding sequence, including a translation initiation signal. 

Nucleic acid molecules that encode any of the AD4 proteins described 
above may be readily expressed by a wide variety of prokaryotic and cuicaryotic host 
5 cells, including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 
known in the art (see, e.g.. Itakura et aL, U.S. Patent No. 4,704,362; Hinnen et aL, Proc. 
Natl. Acad. ScL USA 75:1929-1933, 1978; Murray etal., U.S. Patent No. 4,801.542; 
Upshall ct aL, U.S. Patent No. 4,935,349; Ilagcn ct aL, U.S. Patent No. 4,784,950; Axel 

10 etal., U.S. Patent No. 4,399,216; Goeddel etal., U.S. Patent No. 4,766,075; and 
Sambrook et aL Molecular Cloning: A Laboratory Manual 2nd edition, Cold Spring 
Harbor Laboratory Press, 1989; for plant cells see Czako and Marlon, Plant Physiol. 
I 0*1067-107 U 1994; and Paszkowski et aL, Biotech 2*387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention 

!5 include E. coli, B. subtilis, Salmonella typhimurium, and various species within the 
genus 1 Pseudomonas* Streptomyces, and Staphylococcus, as well as many other 
bacterial species well known to one of ordinary skill in the art. Representative 
examples of bacterial host cells include DH5a (Stratagene, LaJolla, California). 

Bacterial expression vectors preferably comprise a promoter which 

20 functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the p-lactamasc (penicillinase) 
and lactose promoter system (see Chang etal.. Nature 275:615, 1978), the T7 RNA 
polymerase promoter (Studier et aL, Metk Enzymol. 755:60-89, 1990), the lambda 
promoter (Elvin et at.. Gene 57:123-126, 1990), the trp promoter (Nichols and 

25 Yanofsky, Metk in Enzymology 101: 155, 1983) and the tac promoter (Russell etal.. 
Gene 20: 231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 {see Bolivar etal.. Gene 2:95, 1977), the pUC plasmids pUCI8, 

30 pUC19, pUC118, pUC119 (see Messing, Meth. in Enzymology 707:20-77, 1983 and 
Vieira and Messing, Gene 7P:259-268, 1982), and pNH8A, pNH16a, pNH18a, and 
Bluescript Ml 3 (Stratagene, La Jolla, Calif.). 

Yeast and fungi host cells suitable for carrying out the present invention 
include, among others, Saccharomyces pombe, Saccharomyces cerevisiae, the genera 

35 Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et aL, 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 
among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
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(Turnbull, Bio/Technology 7:169, 1989), YRp7 (Struhl etai., Proc. Nati Acad Sci. 
USA 76:1035-1039, 1978), YEpl3 (Broach etai., Gene #121-133, 1979), pJDB249 
and pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters from yeast 
5 glycolytic genes (Hitzcman etai., J. Biol Chem. 255:12073-12080. 1980; Alber and 
Kawasaki, J. MoL Appi Genet, 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young et aL, in Genetic Engineering of Microorganisms for Chemicals, Hollaender 
et al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. Rnzymol 707:192-201, 
1983). Examples of useful promoters for fungi vectors include those derived from 

10 Aspergillus nidulans glycolytic genes, such as the adh3 promoter (McFCnight etai.. 
EMBOJ. 4:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adh3 terminator (McKnight 
etaLiM/., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 

15 selectable marker, which may be one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exists to enable transformants to be selected. 
Preferred selectable markers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etai., ibid), ura3 (Botstein etai., Gene 5:17, 1979), or his3 (Struhl etai., 

20 ibid.). Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid.). Hinnen et al. (Proc Natl. Acad. Sci. 
USA 75:1929-1933, 1978), Yelton etai. (Proc Natl. Acad Sci. USA #/: 1740-1747, 

25 1984), and Russell (Nature 507:167-169, 1983). The genotype of the host cell may 
contain contain a genetic defect that is complemented by the selectable marker present 
on the expression vector. Choice of a particular host and selectable marker is well 
within the level of ordinary skill in the art. 

Protocols for the transformation of yeast are also well known to those of 

30 ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929. 1978) or by treatment with alkaline salts such as LiCl (see Itoh etaL J. 
Bacteriology 755:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 

35 Viral vectors include those which comprise a promoter that directs the 

expression of an isolated nucleic acid molecule that encodes an Alzheimer Disease 
Protein as described above. A wide variety of promoters may be utilized within the 
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context of the present invention, including for example, promoters such as MoMLV 
LTR, RSV LTR, Friend MuLV LTR, Adenoviral promoter (Ohno etal., Science 265: 
781-784, 1994), neomycin phosphotransferase promoter/enhancer, late parvovirus 
promoter (Koering etaL, Hum. Gene Therap. 5:457-463, 1994), Herpes TK promoter, 
5 SV40 promoter, metallothionein Ha gene enhancer/promoter, cytomegalovirus 
immediate early promoter, and the cytomegalovirus immediate late promoter. Within 
particularly preferred embodiments of the invention, the promoter is a tissue-specific 
promoter (see e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative 
examples of suitable tissue specific promoters include neural specific enolase promoter, 

10 platelet derived growth factor beta promoter, bone morpho-genetic protein promoter, 
human alpha 1 -chimaerin, promoter, synapsin I promoter and synapsin II promoter. In 
addition to the above-noted promoters, other viral-specific promoters (e.g., retroviral 
promoters (including those noted above, as well as others such as HIV promoters), 
hepatitis, herpes (e.g., EBV), and bacterial, fungal or parasitic (e.g., malarial) -specific 

15 promoters may be utilized in order to target a specific cell or tissue which is infected 
with a virus, bacteria, fungus or parasite. 

Thus, AD4 proteins of the present invention may be expressed from a 
variety of viral vectors, including for example, herpes viral vectors (e.g., U.S. Patent 
No. 5,288,641), adenoviral vectors (e.g., WO 94/26914, WO 93/9191; Kolls et al., 

20 PNAS 9](l):2\5-2\9, 1994; Kass-Eisler et al., PNAS 90(24):! 1498-502, 1993; Guzman 
et al., Circulation W(6):2838-48, 1993; Guzman et al., Cir. Res. 73(6): 1202- 1207, 
1993; Zabner et al., Cell 75(2):207-216, 1993; Li et al., Hum Gene Ther. 4(4):403-409, 
1993; Caillaud et al., Eur. J. Neurosci. 5(10:1287-1291, 1993; Vincent et al., Nat. 
Genet. 5(2):130-134, 1993; Jaffe et al., Nat. Genet. /(5):372-378, 1992; and Levrero et 

25 al, Gene 101(2): 195-202, 1991), adeno-associated viral vectors (WO 95/13365; Flotte 
et al., PNAS 90(22): 1 06 13- 106 17, 1993), baculovirus vectors, parvovirus vectors 
(Koering et al., Hum. Gene Therap. 5:457-463, 1994), pox virus vectors (Panicali and 
Paoletti, PNAS 70:4927-4931, 1982; and Ozaki et al., Biochcm. Biophys. Res. Comm. 
795(2):653-660, 1993), and retroviruses (e.g., EP 0,415,731; WO 90/07936; WO 

30 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patent No. 5.219,740; WO 
93/1 1230; WO 93/10218. Viral vectors may likewise be constructed which contain a 
mixture of different elements (e.g., promoters, envelope sequences and the like) from 
different viruses, or non-viral sources. Within various embodiments, either the viral 
vector itself, or a viral particle which contains the viral vector may be utilized in the 

35 methods and compositions described below. 

Mammalian cells suitable for carrying out the present invention include, 
among others:. PC12 (ATCC No. CRL1721), N1E-115 neuroblastoma. SK-N-BE(2)C 
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neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NG108-15 murine 
cholinergic cell lines, or rat F2 dorsal root ganglion line, COS (e.g., ATCC No. CRL 
1650 or 1651), BHK (e.g., ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314), CHO 
5 (ATCC No. CCL 61), HeLa (e.g., ATCC No. CCL 2), 293 (ATCC No. 1573; Graham 
et ah, J. Gen. Virol. 5d:59-72, 1977) and NS-1 cells. Other mammalian cell lines may 
be used within the present invention, including Rat Hep 1 (ATCC No. CRL 1600), Rat 
Hep II (ATCC No. CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC 
No. CCL 75.1), Human hepatoma (ATCC No. IITB-52), Hep G2 (ATCC No. I IB 

10 8065), Mouse liver (ATCC No. CCL 29.1). NCTC 1469 (ATCC No. CCL 9.1), SP2/0- 
Agl4 (ATCC No. 1581), HIT-TIS (ATCC No. CRL 1777), and RINm 5AHT 2 B 
(Orskov and Nielson, F£AS'22P(1):175-178, 1988). 

Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 

1 5 or cDNA. Preferred promoters include viral promoters and cellular promoters. Viral 
promoters include the cytomegalovirus immediate early promoter (Boshart et al.. Cell 
41:52 1-530, 1985), cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani etal., Mol Cell Biol. 7:854-864, 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 

20 metallothionein-1 promoter (Palmiter etal., U.S. Patent No. 4,579,821), a mouse V K 
promoter (Bergman et al., Proc. Nail Acad. ScL USA 57:7041-7045, 1983; Grant et al., 
Nucl. Acids Res. 75:5496, 1987) and a mouse Vjj promoter (Loh et aL Cell 53:85-93, 
1983). The choice of promoter will depend, at least in part, upon the level of 
expression desired or the recipient cell line to be transfected. 

25 Such expression vectors may also contain a set of RNA splice sites 

located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 

30 polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid.), the polyadenylation signal from the Adenovirus 5 E1B 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 

35 the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer and the mouse Ig heavy chain enhancer (Gillies, Cell 55:717-728, 
1983). Expression vectors may also include sequences encoding the adenovirus VA 
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RN As. Suitable expression vectors can be obtained from commercial sources (e.g.. 
Stratagene, La Jolla, Calif.). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by, for example, calcium phosphate-mediated 
5 transfection (Wigler etal., Cell N:725 9 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation 
(Neumann ctal., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et aL (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA. a 

10 selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest. Preferred selectable markers for use in cultured mammalian cells include 
genes that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. 
The selectable marker may be an amplifiable selectable marker. Preferred amplifiable 
selectable markers are the DHFR gene and the neomycin resistance gene. Selectable 

1 5 markers are reviewed by Thilly (Mammalian Cell Technology. Butterworth Publishers, 
Stoneham, MA, which is incorporated herein by reference). 

Mammalian cells containing a suitable vector are allowed to grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
Drug selection is then applied to select for growth of cells that are expressing the 

20 selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. Cells 

25 that satisfy these criteria can then be cloned and scaled up for production. 

Protocols for the transfection of mammalian cells are well known to 
those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection. electroporation, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection (see Sambrook et a!., supra). 

30 Numerous insect host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. (Pestic. Sci. 2#:2 15-224. 1990). 

Numerous plant host cells known in the art can also be useful within the 

35 present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al., (J. Biosci (Bangalore) /7:47-58, 1987). 



BNSOOCIO. <WO 07031 92A3_IA> 




WO 97/03192 PCT/US96/11386 

19 

AD4 proteins may be prepared by growing (typically by culturing) the 
host/vector systems described above, in order to express the recombinant Alzheimer 
Disease Proteins. Recombinantly produced A EH proteins may be further purified as 
described in more detail below. 
5 Within related aspects of the present invention, AD4 proteins may be 

expressed in a transgenic animal whose germ cells and somatic cells contain an AD4 
gene which is operably linked to a promoter effective for the expression of the gene 
may also be expressed in non-human transgenic animals such as mice, rats, rabbits, 
sheep, dogs and pigs (sec Hammer etaL Nature 575:680-683, 1985, Palmiler ctal. 

10 Science 222:809-814, 1983, Brinster ctal. Proc. Natl. Acad. Set. USA #2:4438-4442. 
1985, Palmiter and Brinster Cell 47:343-345, 1985 and U.S. Patent Nos. 5,175,383, 
5,087,571, 4,736,866, 5,387,742, 5,347,075, 5,221,778, and 5,175,384). Briefly, an 
expression vector, including a nucleic acid molecule to be expressed together with 
appropriately positioned expression control sequences, is introduced into pronuclei of 

1 5 fertilized eggs, for example, by microinjection. Integration of the injected DNA is 
detected by blot analysis of DNA from tissue samples. It is preferred that the 
introduced DNA be incorporated into the germ line of the animal so that it is passed on 
to the animal's progeny. Tissue-specific expression may be achieved through the use of 
a tissue-specific promoter, or through the use of an inducible promoter, such as the 

20 metallothionein gene promoter (Palmiter eta!., 1983, ibid), which allows regulated 
expression of the transgene. 

Vectors of the present invention may contain or express a wide variety 
of additional nucleic acid molecules in place of or in addition to an AD4 protein as 
described above, either from one or several separate promoters. For example, the viral 

25 vector may express a lymphokine or lymphokine receptor, antisense or ribozyme 
sequence or toxins. Representative examples of lymphokines include IL-1, IL-2, IL-3. 
IL-4, IL-5, IL-6, 1L-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, IL-13, IL-14, IL-15, GM-CSP, 
G-CSF, M-CSF, alpha-interferon, beta-interferon, gamma interferon, and tumor 
necrosis factors, as well as their respective receptors. Representative examples of 

30 antisense sequences include antisense sequences which block the expression of AD4 
protein mutants. Representative examples of toxins include: ricin, abrin, diphtheria 
toxin, cholera toxin, saporin, gelonin, pokeweed antiviral protein, tritin. Shigella toxin, 
and Pseudomonas exotoxin A. 

Within other aspects of the invention, antisense oligonucleotide 

35 molecules are provided which specifically inhibit expression of mutant AD4 nucleic 
acid sequences (see generally, Htrashima et al. in Molecular Biology of RNA: New 
Perspectives (M. Inouye and B. S. Dudock, eds., 1987 Academic Press, San Diego, p. 
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401); Oligonucleotides: Antisense Inhibitors of Gene Expression (J.S. Cohen, ed., 1989 
MacMillan Press, London); Stein and Cheng, Science 267:1004-1012 (1993); WO 
95/10607; U.S. 5,359,051; WO 92/0669.3; and EP-A2-6 12844). Briefly, such 
molecules are constructed such that they are complementary to, and able to form 
5 Watson-Crick base pairs with, a region of transcribed AD4 mutant mRNA sequence 
containing an AD4 mutation. The resultant double-stranded nucleic acid interferes with 
subsequent processing of the mRNA, thereby preventing protein synthesis. 

Within other related aspects of the invention, ribozyme molecules arc 
provided wherein an antisense oligonucleotide sequence is incorporated into a ribozyme 

10 which can specifically cleave mRNA molecules transcribed from a mutant AD4 gene 
(see generally. Kim et al. Proc. Nat. Acad Sci. USA £4:8788 (1987); HaselofT, et al. 
Nature 254:585 (1988), Cecil, JAMA 260:3030 (1988); Jeffries, et at. Nucleic Acids 
Res. 77:1371 (1989); U.S. 5,093,246; U.S. 5,354,855; U.S. 5,144,019; U.S. 5,272,262; 
U.S. 5,254,678; and U.S. 4,987,071). According to this aspect of the invention, the 

15 antisense sequence which is incorporated into a ribozyme includes a sequence 
complementary to, and able to form Watson-Crick base pairs with, a region of the 
transcribed mutant AD4 mRNA containing an AD4 mutation. The antisense sequence 
thus becomes a targeting agent for delivery of catalytic ribozyme activity specifically to 
mutant AD4 mRNA, where such catalytic activity cleaves the mRNA to render it 

20 incapable of being subsequently processed for AD4 protein translation. 



Host Cells 

25 As discussed above, nucleic acid molecules which encode the AD4 

proteins of the present invention (or the vectors which contain and/or express related 
mutants) may readily be introduced into a wide variety of host cells. Representative 
examples of such host cells include plant cells, eukaryotic cells, and prokaryotic cells. 
Within preferred embodiments, the nucleic acid molecules are introduced into cells 

30 from a vertebrate or warm-blooded animal, such as a human, macaque, dog, cow. horse, 
pig, sheep, rat, hamster, mouse or fish cell, or any hybrid thereof. 

Preferred prokaryotic host cells for use within the present invention 
include E. coli, Salmonella. Bacillus, Shigella. Pseudomonas. Streptomyces and other 
genera. Techniques for transforming these hosts and expressing foreign DNA 

35 sequences cloned therein are well known in the art (see, e.g.. Maniatis et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 1982, which is 
incorporated herein by reference; or Sambrook etal., supra). Vectors used for 



BNSDOCID: <WO 97031A2A3 JA> 



WO 97/03192 PCIYUS96/U386 

21 

expressing cloned DNA sequences in bacterial hosts will generally contain a selectable 
marker, such as a gene for antibiotic resistance, and a promoter that functions in the 
host cell. Appropriate promoters include the trp (Nichols and Yanofsky, Meth. 
Enzymol /0/M55-I64, 1983), lac (Casadaban eta!., J. Bacterial. 745:971-980, 1980), 
5 and phage X (Queen, ./. Mol Appl. Genet. 2:1-10, 1983) promoter systems. Plasmids 
useful for transforming bacteria include the pUC plasmids (Messing, Melh. Enzymol. 
/0/:20-78, 1983; Vieira and Messing, Gene 79:259-268, 1982), pBR322 (Bolivar et aL, 
Gene 2:95-1 13, 1977), pCQV2 (Queen, ibid\ and derivatives thereof. Plasmids may 
contain both viral and bacterial elements. 

10 Preferred eukaryotic cells include cultured mammalian cell lines (e.g.. 

rodent or human cell lines) and fungal cells, including species of yeast (e.g., 
Saccharomyces spp., particularly S. cerevisiae, Schizosaccharomyces spp., or 
Kluyveromyces spp.) or filamentous fungi (e.g., Aspergillus spp., Neurospora spp.). 
Strains of the yeast Saccharomyces cerevisiae are particularly preferred. Methods for 

15 producing recombinant proteins in a variety of prokaryotic and eukaryotic host cells are 
generally known in the art (see, "Gene Expression Technology," Methods in 
Enzymology, Vol. 185, Goeddel (ed.). Academic Press. San Diego, Calif., 1990; see 
also, "Guide to Yeast Genetics and Molecular Biology," Methods in Enzymology, 
Guthrie and Fink (cds.). Academic Press, San Diego, Calif., 1991). In general, a host 

20 cell will be selected on the basis of its ability to produce the protein of interest at a high 
level or its ability to carry out at least some of the processing steps necessary for the 
biological activity of the protein. In this way, the number of cloned DNA sequences 
that must be introduced into the host cell can be minimized and overall yield of 
biologically active protein can be maximized. 

25 The nucleic acid molecules (or vectors) may be introduced into host 

cells by a wide variety of mechanisms, including for example calcium phosphate- 
mediated transfection (Wigier et a!., Cell 14:125, 1978), lipofection; gene gun (Corsaro 
and Pearson, Somatic Cell Gen. 7:603, 1981; Graham and Van der Eb, Virology 52:456, 
1973), electroporation (Neumann et al., EMBO ./. 7:841-845, 1982), retroviral, 

30 adenoviral, protoplast fusion-mediated transfection or DEAE-dextran mediated 
transfection (Ausubel et aL, (eds.), Current Protocols in Molecular Biology. John 
Wiley and Sons, lnc M NY, NY, 1987). 

Host cells containing vector constructs of the present invention are then 
cultured to express a DNA molecule as described above. The cells are cultured 

35 according to standard methods in a culture medium containing nutrients required for 
growth of the chosen host cells. A variety of suitable media are known in the art and 
generally include a carbon source, a nitrogen source, essential amino acids, vitamins 
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and minerals, as well as other components, e.#, growth factors or serum, that may be 
required by the particular host cells. The growth medium will generally select for cells 
containing the DNA construct(s) by, for example, drug selection or deficiency in an 
essential nutrient which is complemented by the selectable marker on the DNA 
5 construct or co-transfected with the DNA construct. 

Suitable growth conditions for yeast cells, for example, include culturing 
in a chemically defined medium, comprising a nitrogen source, which may be a non- 
arnino acid nitrogen source or a yeast extract, inorganic salts, vitamins and essential 
amino acid supplements at a temperature between 4°C and 37°C, with 30°C being 

10 particularly preferred. The pH of the medium is preferably maintained at a pH greater 
than 2 and less than 8. more preferably pH 5-6. Methods for maintaining a stable pH 
include buffering and constant pH control. Preferred agents for pH control include 
sodium hydroxide. Preferred buffering agents include succinic acid and Bis-Tris 
(Sigma Chemical Co., St. Louis, Mo.)- Due to the tendency of yeast host cells to 

1 5 hyperglycosylate heterologous proteins, it may be preferable to express the nucleic acid 
molecules of the present invention in yeast cells having a defect in a gene required for 
asparagine-1 inked glycosylation. Such cells are preferably grown in a medium 
containing an osmotic stabilizer. A preferred osmotic stabilizer is sorbitol 
supplemented into the medium at a concentration between 0.1 M and 1.5 M, preferably 

20 at 0.5 M or 1.0 M. 

Cultured mammalian cells are generally cultured in commercially 
available serum-containing or serum-free media. Selection of a medium and growth 
conditions appropriate for the particular cell line used is well within the level of 
ordinary skill in the art. 

25 

Antibodies 

Antibodies to the AD4 proteins discussed above may readily be prepared 
given the disclosure provided herein. Such antibodies may, within certain 
embodiments, specifically recognize wild-type AD4 protein rather than mutant AD4 

30 protein, mutant AD4 protein rather than wild-type AD4 protein, or equally recognize 
both wild-type and mutant AD4 protein. Within the context of the present invention, 
antibodies are understood to include monoclonal antibodies, polyclonal antibodies, anti- 
idiotype antibodies, antibody fragments (e.g., Fab, and F(ab')2, F v variable regions, or 
complementarity determining regions). As discussed above, antibodies are understood 

35 to be specific against an AD4 protein if it binds with a K a of greater than or equal to 
10~?M, preferably greater than of equal to 10" 8 M. The affinity of a monoclonal 
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antibody or binding partner can be readily determined by one of ordinary skill in the art 
(see Scatchard,^w». N. Y. Acad. ScL 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 
ordinary skill in the art from a variety of warm-blooded animals such as horses, cows. 
5 various fowl, rabbits, mice, or rats. Typically, an AD4 protein or unique AD4 peptide 
of 13-20 amino acids (preferably conjugated to keyhole limpet hemocyanin by cross- 
linking with glutaraldehyde) is utilized to immunize the animal through intraperitoneal, 
intramuscular, intraocular, or subcutaneous injections, an adjuvant such as Freund's 
complete or incomplete adjuvant. Following several booster immunizations, samples 

10 of serum are collected and tested for reactivity to the AD4 protein. Particularly 
preferred polyclonal antisera will give a signal on one of these assays that is at least 
three times greater than background. Once the titer of the animal has reached a plateau 
in terms of its reactivity to the protein, larger quantities of antisera may be readily 
obtained either by weekly bleedings, or by exsanguinating the animal. 

1 5 Monoclonal antibodies may also be readily generated using conventional 

techniques (see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543.439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 
McKearn, and Bechtol (eds.), 1980, and Antibodies: A Laboratory Manual. Harlow and 

20 Lane (eds.). Cold Spring Harbor Laboratory Press, 1988. which are also incorporated 
herein by reference). 

Briefly, within one embodiment a subject animal such as a rat or mouse 
is injected with an AD4 protein or portion thereof as described above. The protein may 
be admixed with an adjuvant such as Freund's complete or incomplete adjuvant in order 

25 to increase the resultant immune response. Between one and three weeks after the 
initial immunization the animal may be reimmunized with another booster 
immunization, and tested for reactivity to the protein utilizing assays described above. 
Once the animal has reached a plateau in its reactivity to the injected protein, it is 
sacrificed, and organs which contain large numbers of B cells such as the spleen and 

30 lymph nodes are harvested. 

Cells which are obtained from the immunized animal may be 
immortalized by transfection with a virus such as the Epstein-Barr virus (EBV) (see 
Glasky and Reading, Hybridoma 8(4)-311-?>%9, 1989). Alternatively, within a preferred 
embodiment, the harvested spleen and/or lymph node cell suspensions are fused with a 

35 suitable myeloma cell in order to create a "hybridoma" which secretes monoclonal 
antibody. Suitable myeloma lines include, for example, NS-1 (ATCC No. TIB 18), and 
P3X63 - Ag 8.653 (ATCC No. CRL 1580). 



WO 97/03192 PCTOJS96/11386 

24 



Following the fusion, the cells may be placed into culture plates 
containing a suitable medium, such as RPMI 1640, or DMEM (Dulbecco's Modified 
Eagles Medium) (JRH Biosciences, Lenexa, Kansas), as well as additional ingredients, 
such as Fetal Bovine Serum (FBS, i.e., from Hyclone, Logan, Utah, or JRH 
5 Biosciences). Additionally, the medium should contain a reagent which selectively 
allows for the growth of fused spleen and myeloma cells such as HAT (hypoxanthine, 
aminopterin, and thymidine) (Sigma Chemical Co., St. Louis, Missouri). After about 
seven days, the resulting fused ceils or hybridomas may be screened in order to 
determine the presence of antibodies which arc reactive against an AD4 protein. A 

10 wide variety of assays may be utilized to determine the presence of antibodies which 
are reactive against the proteins of the present invention, including for example 
countercurrent Immunoelectrophoresis, radioimmunoassays, radioimmunoprecipita- 
tions, enzyme-linked immunosorbent assays (ELISA), dot blot assays. Western blots, 
immunoprecipitation, inhibition or competition assays, and sandwich assays (sec U.S. 

15 Patent Nos. 4,376,110 and 4,486,530; see also Antibodies: A Laboratory Manual. 
Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 1988). Following 
several clonal dilutions and reassays, a hybridoma producing antibodies reactive against 
the AD4 protein may be isolated. 

Other techniques may also be utilized to construct monoclonal 

20 antibodies (see William D. Huse et al., "Generation of a Large Combinational Library 
of the Immunoglobulin Repertoire in Phage Lambda," Science 246:121 5-1281, 
December 1989; see also L. Sastry etal., "Cloning of the Immunological Repertoire in 
Escherichia coii for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc. Natl Acad. ScL USA 

25 ^6/5728-5732, August 1989; see also Michelle Alting-Mees etal., "Monoclonal 
Antibody Expression Libraries: A Rapid Alternative to Hybridomas," Strategies in 
Molecular Biology 3: 1 -9, January 1990; these references describe a commercial system 
available from Stratacyte, La Jolla, California, which enables the production of 
antibodies through recombinant techniques). Briefly, mRNA is isolated from a B cell 

30 population, and utilized to create heavy and light chain immunoglobulin cDNA 
expression libraries in the MmmunoZap(H) and MmmunoZap(L) vectors. These 
vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies (see Huse et al., supra\ sec also Sastry et al., supra). Positive plaques may 
subsequently be converted to a non-lytic plasmid which allows high level expression of 

35 monoclonal antibody fragments from £ coll. 

Similarly, portions or fragments, such as Fab or Fv fragments, of 
antibodies may also be constructed utilizing conventional enzymatic digestion or 



3NSOOC3D: <WO 9703192A3_IA> 



WO 97/03192 



PCT/US96/11386 



25 

recombinant DNA techniques to incorporate the variable regions of a gene which 
encodes a specifically binding antibody. Within one embodiment, the genes which 
encode the variable region from a hybridoma producing a monoclonal antibody of 
interest are amplified using nucleotide primers for the variable region. These primers 
5 may be synthesized by one of ordinary skill in the art, or may be purchased from 
commercially available sources. Stratacyte (La Jolla, Calif.) sells primers for mouse 
and human variable regions including, among others, primers for V Ha , V HD , V HC , V Hc j, 
C H j, V L and C L regions. These primers may be utilized to amplify heavy or light chain 
variable regions, which may then be inserted imo vectors such as InimunoZAP™ H or 

10 ImmunoZAP™ L (Stratacyte), respectively. These vectors may then be introduced into 
E. coli, yeast, or mammalian -based systems for expression. Utilizing these techniques, 
large amounts of a single-chain protein containing a fusion of the V H and V L domains 
may be produced (sec Bird etaL, Science 242:423-426, 1988). In addition, such 
techniques may be utilized to change a "murine' 1 antibody to a "human" antibody, 

1 5 without altering the binding specificity of the antibody. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art (see 
Antibodies: A Laboratory ManuaL Harlow and Lane (eds.). Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 

20 columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 

Antibodies of the present invention have many uses. For example, 
antibodies may be utilized in flow cytometry to sort cells bearing such an AD4 protein. 
Briefly, in order to detect the protein or peptide of interest on cells, the cells arc 

25 incubated with a labeled monoclonal antibody which specifically binds to the protein of 
interest, followed by detection of the presence of bound antibody. These steps may also 
be accomplished with additional steps such as washings to remove unbound antibody. 
Labels suitable for use within the present invention are well known in the art including, 
among others, flourescein isothiocyanate (FITC), phycoerythrin (PE), horse radish 

30 peroxidase (HRP), and colloidal gold. Particularly preferred for use in flow cytometry 
is FITC, which may be conjugated to purified antibody according to the method of 
Kellkamp in "Conjugation of Fluorescein Isothiocyanate to Antibodies. I. Experiments 
on the Conditions of Conjugation,** Immunology 75:865-873, 1970. (See also 
Keltkamp, "Conjugation of Fluorescein Isothiocyanate to Antibodies. II. A 

35 Reproducible Method," Immunology 75:875-881, 1970; Goding, "Conjugation of 
Antibodies with Fluorochromes: Modification to the Standard Methods," J. Immunol 
Methods 75:215-226.1970.). 
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Assays 

Assays useful within the context of the present invention include those 
assays for detecting agonists or antagonists of AD4 protein activity. Other assays are 
5 useful for the screening of peptide or organic molecule libraries. Still other assays are 
useful for the identification and/or isolation of nucleic acid molecules and/or peptides 
within the present invention, or for diagnosis of a patient with an increased likelihood 
of contracting Alzheimer's Disease. 

10 A. Nucleic Acid Based Diaenostic Tests 

Briefly, another aspect of the present invention provides probes and 
primers for detecting the AD4 genes and/or mutants thereof. In one embodiment of this 
aspect, probes are provided that are capable of specifically hybridizing to RNA or DNA 
from the AD4 gene. For purposes of the present invention, probes are "capable of 

15 hybridizing" to AD4 genes DNA or RNA if they hybridize to an AD4 gene under 
conditions of either high or moderate stringency (see Sambrook et aL, supra) but not 
significantly or detectably to the AD3 gene. Preferably, the probe may be utilized to 
hybridize to suitable nucleotide sequences under high stringency conditions, such as 5x 
SSPE, lx Denhardt's solution (Sambrook et dl., supra), 0.1% SDS at 65°C and at least 

20 one wash to remove excess probe in the presence of 0.2x SSC, 1 x Denhardt's solution, 
0.1% SDS at 65°C. Except as otherwise provided herein, probe sequences are designed 
to allow hybridization to AD4 genes, but not to DNA or RNA sequences from other 
genes, such as AD3. The probes are used, for example, to hybridize to nucleic acid that 
is present in a biological sample isolated from a patient. The hybridized probe is then 

25 detected, thereby indicating the presence of the desired cellular nucleic acid. 
Preferably, the cellular nucleic acid is subjected to an amplification procedure, such as 
PCR, prior to hybridization. Alternatively, the AD4 gene may be amplified and the 
amplified product subjected to DNA sequencing. Mutants of AD4 may be detected by 
DNA sequence analysis or hybridization with allele-specific oligonucleotide probes 

30 under conditions and for time sufficient to allow hybridization to the specific allele. 
Typically, the hybridization buffer and wash will contain tetramethyl ammonium 
chloride or the like (see Sambrook et aL, supra). 

Nucleic acid probes of the present invention may be composed of either 
deoxyribonucleic acids (DNA), ribonucleic acids (RNA), nucleic acid analogues (e.g.. 

35 peptide nucleic acids), or any combination thereof, and may be as few as about 12 
nucleotides in length, usually about 14 to 18 nucleotides in length, and possibly as large 
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as the entire sequence of an AD4 gene. Selection of probe size is somewhat dependent 
upon the use of the probe, and is within the skill of the art. 

Suitable probes can be constructed and labeled using techniques that are 
well known in the art. Shorter probes of. for example, 12 bases can be generated 

5 synthetically and labeled with using T4 polynucleotide kinase. Longer probes of 
about 75 bases to less than 1.5 kb are preferably generated by, for example, PCR 
amplification in the presence of labeled precursors such as [<x- 32 P]dCTP. digoxigenin- 
dUTP, or biotin-dATP. Probes of more than 1.5 kb are generally most easily amplified 
by transfecting a ceil with a plasmid containing the relevant probe, growing the 

10 transfected cell into large quantities, and purifying the relevant sequence from the 
transfected cells. (See Sambrook et aL supra.) 

Probes can be labeled by a variety of markers, including for example, 
radioactive markers, fluorescent markers, enzymatic markers, and chromogenic 
markers. The use of 32 P is particularly preferred for marking or labeling a particular 

15 probe. 

It is a feature of this aspect of the invention that the probes can be 
utilized to detect the presence of AD4 mRNA or DNA within a sample. However, if 
the nucleic acid is present in only a limited number, then it may be beneficial to amplify 
the relevant sequence so that it may be more readily detected or obtained. 

20 A variety of methods may be utilized in order to amplify a selected 

sequence, including, for example, RNA amplification (see Lizardi el aL, 
Bio/Technology 6:\ 197-1202, 1988; Kramer et aL, Nature 33P:401-402, 1989; Lomcli 
etaL, Clinical Chcm. 3.5(9): 1826- 183 1, 1989; U.S. Patent No. 4,786,600), and DNA 
amplification utilizing LCR or Polymerase Chain Reaction ("PCR") (see. U.S. Patent 

25 Nos. 4.683,195, 4,683,202, and 4,800,159) (see also U.S. Patent Nos. 4,876,187 and 
5,01 1,769, which describe an alternative detection/amplification system comprising the 
use of scissile linkages), or other nucleic acid amplification procedures that are well 
within the level of ordinary skill in the art. With respect to PCR, for example, the 
method may be modified as known in the art. Transcriptional enhancement of PCR 

30 may be accomplished by incorporation of bacteriophage T7 RNA polymerase promoter 
sequences in one of the primary oligonucleotides, and immunoenzymatic detection of 
the products from the enhanced emitter may be effected using anti-RNA;DNA 
antibodies (Blais, Appl Environ. Microbiol. 60:348-352, 1994). PCR may also be used 
in combination with reverse dot-blot hybridization (Iida et aL, FEMS Microbiol Lett. 

35 7/4:167-172, 1993). PCR products may be quantitatively analyzed by incorporation of 
dUTP (DuplaaetaL, Anal Biochem. 2/2:229-236, 1993), and samples may be filter 
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sampled for PCR-gene probe detection (Bej et aL, Appl Environ. Microbiol. 
57:3529-3534, 1991). 

Within a particularly preferred embodiment, PCR amplification is 
utilized to detect AD4 DNA. Briefly, as described in greater detail below, a DNA 
5 sample is denatured at 95°C in order to generate single-stranded DNA. Specific 
primers are then annealed to the single-stranded DNA at 37°C to 70°C depending on 
the proportion of AT/GC in the primers. The primers are extended at 72°C with Tag 
DNA polymerase in order to generate the opposite strand to the template. These steps 
constitute one cycle, which may be repeated in order to amplify the selected sequence. 

10 Within an alternative preferred embodiment, LCR amplification is 

utilized for amplification. LCR primers are synthesized such that the 5' base of the 
upstream primer is capable of hybridizing to a unique base pair in a desired gene to 
specifically detect an AD4 gene. 

Within another preferred embodiment, the probes arc used in an 

1 5 automated, non-isotopic strategy wherein. target nucleic acid sequences arc amplified by 
PCR. and then desired products are determined by a coiorimetric oligonucleotide 
ligation assay (OLA) (Nickerson et al., Proc. Natl. Acad ScL USA 57:8923-8927, 
1990). 

Primers for the amplification of a selected sequence should be selected 

20 from sequences that are highly specific to AD4 (an not AD3) and form stable duplexes 
with the target sequence. The primers should also be non-complementary, especially at 
the 3' end, should not form dimers with themselves or other primers, and should not 
form secondary structures or duplexes with other regions of DNA. In general, primers 
of about 18 to 20 nucleotides are preferred, and can be easily synthesized using 

25 techniques well known in the art. PCR products, and other nucleic acid amplification 
products, may be quantitated using techniques known in the art (Duplaa et al., Anal. 
Biochem. 2/2:229-236, 1993; Higuchi et al., Bio/Technology 77:1026-1030). 

Within a particularly preferred embodiment, restriction enzyme analysis 
is used to detect a mutant AD4 gene. For example, within One embodiment the N 1 4 1 1 

30 mutation in AD4 creates a Sav&A I restriction site. cDNA, genomic DNA, or amplified 
AD4 cDNA or genomic DNA is restricted with Sau3A I. If the AD4 gene contains the 
N141I alteration, the Sau3A I fragment or amplified product containing codon 141 will 
be cleaved into two smaller fragments. The cleavage is readily detected by 
electrophoresis in agarose or acrylamide gels and visualized by ethidium bromide 

35 fluorescence or probe hybridization using AD4 gene fragments or oligonucleotides. 
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B. Antibody-based Diagnostic Tests 

Still another aspect of the present invention provides antibodies, as 
discussed above, for the detection of AD4 gene products in diagnostic tests. A variety 
of assays can be utilized in order to detect antibodies that specifically bind to the 
5 desired protein or peptide. Exemplary assays are described in detail in Antibodies: A 
laboratory Manual. Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 
1988. Representative examples of such assays include: countercurrent immuno- 
electrophoresis (CIEP), radioimmunoassays, radioimmunoprccipitations, enzyme- 
linked immuno-sorbenl assays (ELISA), dot blol assays, inhibition or competition 

10 assays, and sandwich assays, immunostick (dipstick) assays, simultaneous 
immunoassays, immunochromatographic assays, immunoflltration assays, latex bead 
agglutination assays, immunofluorescent assays, biosensor assays, and low-light 
detection assays (see U.S. Patent Nos. 4,376,1 10 and 4,486.530; see also Antibodies: A 
Laboratory Manual, supra). 

15 A fluorescent antibody test (FA-test) uses a fluorescently-labelcd 

antibody able to bind to one of the proteins of the invention. For detection, visual 
determinations are made by a technician using fluorescence microscopy, yielding a 
qualitative result. In a preferred embodiment, this assay is used for the examination of 
tissue samples and histological sections. 

20 In latex bead agglutination assays, antibodies to one or more of the 

proteins of the present invention are conjugated to latex beads. The antibodies 
conjugated to the latex beads are then contacted with a sample under conditions 
permitting antibodies to bind to desired proteins in the sample, if any. The results are 
then read visually, yielding a qualitative result. In a preferred embodiment, this format 

25 can be used in the field for on-site testing. 

Enzyme immunoassays (EIA) include a number of different assays able 
to utilize the antibodies provided by the present invention. For example, a 
heterogeneous indirect EIA uses a solid phase coupled with an antibody of the 
invention and an affinity purified, anti-IgG immunoglobulin preparation. Preferably, 

30 the solid phase is a polystyrene microtiter plate. The antibodies and immunoglobulin 
preparation are then contacted with the sample under conditions permitting antibody 
binding, which conditions are well known in the art. The results of such an assay can 
be read visually, but are preferably read using a spectrophotometer, such as an ELISA 
plate reader, to yield a quantitative result. 

35 An alternative solid phase EIA format includes a plastic-coated ferrous 

metal beads able to be moved during the procedures of the assay by means of a magnet. 
Yet another alternative is a low-light detection immunoassay format. In this highly 
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sensitive formal, the light emission produced by appropriately labeled bound antibodies 
are quanlitated automatically. Preferably, the reaction is performed using microtiter 
plates. 

In a capture-antibody sandwich enzyme assay, the desired protein is 
5 bound between an antibody attached to a solid phase, preferably a polystyrene 
microtiter plate, and a labeled antibody. Preferably, the results arc measured using a 
spectrophotometer, such as an ELISA plate reader. 

In an alternative embodiment, a radioactive tracer is substituted for the 
enzyme mediated detection in an EIA to produce a radioimmunoassay (RIA). 

W In a sequential assay format, reagents are allowed to incubate with the 

capture antibody in a step wise fashion. The test sample is first incubated with the 
capture antibody. Following a wash step, an incubation with the labeled antibody 
occurs. In a simultaneous assay, the two incubation periods described in the sequential 
assay are combined. This eliminates one incubation period plus a wash step. 

15 A dipstick/immunostick format is essentially an immunoassay except 

that the solid phase, instead of being a polystyrene microtiter plate, is a polystyrene 
paddle or dipstick. Reagents are the same and the format can either be simultaneous or 
sequential. 

In a chromatographic strip test format, a capture antibody and a labeled 

20 antibody are dried onto a chromatographic strip, which is typically nitrocellulose or 
nylon of high porosity bonded to cellulose acetate. The capture antibody is usually 
spray dried as a line at one end of the strip. At this end there is an absorbent material 
that is in contact with the strip. At the other end of the strip the labeled antibody is 
deposited in a manner that prevents it from being absorbed into the membrane. 

25 Usually, the label attached to the antibody is a latex bead or colloidal gold. The assay 
may be initiated by applying the sample immediately in front of the labeled antibody. 

Immunofiltration/immunoconcentration formats combine a large solid 
phase surface with directional flow of sample/reagents, which concentrates and 
accelerates the binding of antigen to antibody. In a preferred format, the test sample is 

30 preincubated with a labeled antibody then applied to a solid phase such as fiber filters 
or nitrocellulose membranes or the like. The solid phase can also be precoated with 
latex or glass beads coated with capture antibody. Detection of analyte is the same as 
standard immunoassay. The flow of sample/reagents can be modulated by either 
vacuum or the wicking action of an underlying absorbent material. 

35 A threshold biosensor assay is a sensitive, instrumented assay amenable 

to screening large number of samples at low cost. In one embodiment, such an assay 
comprises the use of light addressable potentiometric sensors wherein the reaction 
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involves the detection of a pH change due to binding of the desired protein by capture 
antibodies, bridging antibodies and urease-conjugated antibodies. Upon binding, a pH 
change is effected that is measurable by translation into electrical potential (u.vo!ts). 
The assay typically occurs in a very small reaction volume, and is very sensitive. 
5 Moreover, the reported detection limit of the assay is 1,000 molecules of urease per 
minute. 

C. Other Assays 

Transmembrane receptors are involved in many cellular communication 

10 process and have been the targets of numerous pharmacologic screening assays for the 
identification and development of new therapeutic agents. Many of these screening 
assays look for ligand induced changes in cell lines expressing the recombinant 
receptor. In some cases second messengers are assayed directly while in others, 
receptor is transfected into a cell line carrying a reporter gene construct whose 

15 expression level can be influenced (positively or negatively) by functional activation of 
the receptor. One common result of the stimulation of many different second 
messenger systems is transient changes in intracellular calcium homeostasis, this can 
be the result of Ca++ release from various intracellular compartments or from the influx 
of extracellular calcium. 

20 Calcium transients offer a highly sensitive and selective method for 

characterization of AD4 gene function. Expression of recombinant AD4 in cell lines 
previously transfected with an aequorin reporter construct can be used to screen for and 
identify an AD4 ligand. Aequorin is a 21 kDa photoprotein that upon Ca-» + binding 
undergoes an irreversible reaction with the production of light in the visible range. 

25 Because the fractional rate of aequorin consumption is proportional in the physiological 
[Ca++], it has been used for many years as a sensitive indicator of intracellular calcium. 
More recently, several different aequorin cDNA's have been engineered which allow 
selective targeting of aequorin expression to different intracellular compartments, 
including the cytoplasm, the nucleus and the endoplasmic reticulum. This allows for a 

30 variety of second messenger coupled pathways/compartments to be screened. 

Threefore, within one embodiment a cell line which expresses mutant 
AD4 may be utilized in order to identify compounds which modify the mutant protein 
function in a way that mimics wild-type AD4 activity. 

35 Labels 

AD4 proteins, nucleic acid molecules which encodes such proteins, anti- 
AD4 protein antibodies and agonists or antagonists, as described above and below, may 
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be labeled with a variety of molecules, including for example, fluorescent molecules, 
toxins, and radionuclides. Representative examples of fluorescent molecules include 
fluorescein, Phycobili. proteins, such as phycoerythrin, hodamine, Texas red and 
luciferase. Representative examples of toxins include ricin, abrin diphtheria toxin, 
5 cholera toxin, gelonin, pokeweed antiviral protein, tritin, Shigella toxin, and 
Pseudomonas exotoxin A. Representative examples of radionuclides include Cu-64, 
Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105, Pd-I09, In-ll 1, M23, 1-125, 1-I31, Re- 
186, Re-188, Au-198, Au-199, Pb-203, At-211, Pb-212 and Bi-212. In addition, the 
antibodies described above may also be labeled or conjugated to one partner of a ligand 

10 binding pair. Representative examples include avidin-biotin, and riboflavin-riboflavin 
binding protein- 
Methods for conjugating or labeling the AD4 proteins, nucleic acid 
molecules which encodes such proteins, anti-AD4 protein antibodies and agonists or 
antagonists, as discussed above, with the representative labels set forth above may be 

15 readily accomplished by one of ordinary skill in the art (see Trichothecene Antibody 
Conjugate, U.S. Patent No. 4,744,981,; Antibody Conjugate, U.S. Patent No. 
5,106,951; Fluorogenic Materials and Labeling Techniques, U.S. Patent No. 4,018,884; 
Metal Radionuclide labeled Proteins for Diagnosis and Therapy, U.S. Patent No. 
4,897,255; and Metal Radionuclide Chelating Compounds for Improved Chelation 

20 Kinetics, U.S. Patent No. 4,988,496; see also Inman, Methods In Enzymology, Vol. 34, 
Affinity Techniques. Enzyme Purification: Part B, Jakoby and Wilchek (eds.). 
Academic Press, New York, p. 30, 1974; see also Wilchek and Bayer, "The Avidin- 
Biotin Complex in Bioanalytical Applications," Anal. Biochem. 17] A -32, 1 988). 

25 Pharmaceutical Compositions 

As noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-described AD4 proteins, 
nucleic acid molecules, vectors, antibodies, host cells, agonists or antagonists, along 
with a pharmaceutically or physiologically acceptable carrier, excipients or diluents. 

30 Generally, such carriers should be nontoxic to recipients al the dosages and 
concentrations employed. Ordinarily, the preparation of such compositions entails 
combining the therapeutic agent with buffers, antioxidants such as ascorbic acid, low 
molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, 
carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, 

35 glutathione and other stabilizers and excipients. Neutral buffered saline or saline mixed 
with nonspecific serum albumin are exemplary appropriate diluents. 
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In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes, although 
intracranial routes are typically preferred. In addition, pharmaceutical compositions of 
the present invention may be placed within containers, along with packaging material 
5 which provides instructions regarding the use of such pharmaceutical compositions. 
Generally, such instructions will include a tangible expression describing the reagent 
concentration, as well as within certain embodiments, relative amounts of excipient 
ingredients or diluents (e.g., water, saline or PBS) which may be necessary to 
reconstitute the pharmaceutical composition. 

10 

Methods of Treating or Preventing Alzheimer Disease 
The present invention also provides methods for treating or preventing 
Alzheimer Disease, comprising the step of administering to a patient a vector (e.g.. 
expression vector, viral vector, or viral particle containing a vector) or nucleic acid 
15 molecules alone, as described above, thereby reducing the likelihood or delaying the 
onset of Alzheimer's Disease. 

Similarly, therapeutic peptides, peptidomimetics, or small molecules 
may be used to delay onset of Alzheimer's disease, lessen symptoms, or halt or delay 
progression of the disease. Such therapeutics may be tested in a transgenic animal 
20 model that expresses mutant protein, wild-type and mutant protein, or in an in vitro 
assay system. 

One such in vitro assay system measures the amount of amyloid protein 
produced. Briefly, by way of illustration, a cell expressing both AD4 gene product and 
amyloid is cultured in the presence of a candidate therapeutic molecule. The AD4 

25 protein expressed by the cell may be either wild-type or mutant protein. In either case, 
the amount of amyloid protein that is produced is measured from cells incubated with 
or without (control) the candidate therapeutic. Briefly, by way of example, cells are 
labeled in medium containing 35 S-methtonine and incubated in the presence (or 
absence) of candidate therapeutic. Amyloid protein is detected in the culture 

30 supernatent by immunoprecipitation and SDS-PAGE electrophoresis or by ELISA. A 
statistically significant reduction of amyloid protein compared to the control signifies a 
therapeutic suitable for use in preventing or treating Alzheimer's disease. 

Alternatively, transgenic animals expressing Alzheimer's disease protein 
may be used to test candidate therapeutics. Amyloid protein is measured or, if the 

35 animals exhibit other disease symptoms, such as memory or learning deprivation, an 
increase in memory or learning is measured. Memory and learning are tested in rodents 
by the Morris water maze (Stewart and Morris in Behavioral Neuroscience. R. Saghal, 
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Ed. (IRLPress, 1993, p. 107) and the Y-maze (Brits et al., Brain Res. Bull. 6:71, 1981). 
Therapeutics are administered to animals prior to testing. The response time in trials 
are measured and an improvement in memory and learning is demonstrated by a 
statistically significant decrease in the timed trials. 
5 As noled above, the present invention provides methods for treating or 

preventing Alzheimer's disease through the administration to a patient of a 
therapeutically effective amount of an antagonist or pharmaceutical composition as 
described herein. Such patients may be identified through clinical diagnosis based on 
symptoms of dementia or learning and memory loss which are not attributable to other 

10 causes. In addition, patients are also identified through diagnosis of brain atrophy as 
determined by magnetic resonance imaging. 

Cognitive behavior in AD may be measured by any one of several tests 
(See Gcrshon et al M Clinical Evaluation of Psychotropic Drugs: Principles and 
Guidelines. Prien and Robinson (eds.). Raven Press, Ltd., New York, 1994, p. 467). 

15 One such test, BCRS, is designed to measure only cognitive functions: concentration, 
recent memory, past memory, orientation, functioning, and self-care. This test, as well 
as the Weschier Memory Scale and the Alzheimer's Disease-Associated Scale, may be 
used to determine improvement following therapeutic treatment. "Improvement" in 
Alzheimer's disease is present if there is a statistically significant difference in the 

20 direction of normality in the Weschier Memory Scale test. For example, test results of 
the performance of treated patients as are compared to members of the placebo group or 
between subsequent tests given to the same patient. Improvement within the present 
invention also encompasses a delay in the age of onset of Alzheimer's disease. 

As will be evident to one of skill in the art, the amount and frequency of 

25 administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth. 
Typically, the compositions may be administered by a variety of techniques, although 
intra-cronial routes are often preferred. 

Within other embodiments of the invention, the. vectors which contain or 

.30 express the nucleic acid molecules which encode the AD4 proteins described above, or 
even the nucleic acid molecules themselves may be administered by a variety of 
alternative techniques, including for example administration of asialoosomucoid 
(ASOR) conjugated with poly (L-lysine) DNA complexes (Cristano et al., PNAS 
92122-92126, 1993), DNA linked to killed adenovirus (Curiel et al., Hum. Gene Ther. 

35 5(2):147-154, 1992), cytofectin-mediated introduction (DMRIE-DOPE, Vical, Calif), 
direct DNA injection (Acsadi et al.. Nature 352:815-818, 1991); DNA ligand (Wu et 
al., J. ofBiol. Chem. 26V: 1 6985- 1 6987, 1989); lipofection (Feigner et al., Proc. Natl. 
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Acad. ScL USA #4:7413-74 17, 1989); liposomes (Pickering et aL Circ. £0(l):13-21, 
1994; and Wang et aL PNAS W;7851-7855, 1987); microprqjectile bombardmenl 
(Williams et al., PNAS ^5:2726-2730, 1991); and direct delivery of nucleic acids which 
encode the AD4 protein itself either alone (Vile and Hart, Cancer Res. 53: 3860-3864, 
5 1993), or utilizing PEG-nucIeic acid complexes. 

The following examples are offered by way of illustration, and not by 
way oflimitation. 
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EXAMPLES 
EXAMPLE 1 

Genetic Mapping of the AD locus fro m the Volga German Kindred 

5 

The Volga German kindreds are a group of 7 related families with 
autosomal dominant early-onset Alzheimer's disease (AD). These families come from a 
group of Germans, who, in the 1760's, emmigrated to the Volga river region of Russia. 
The VG remained culturally distinct from the surrounding Russian population and has 

10 apparently been genetically relatively isolated until this century. In 1987, Bird and 
collegues (Bird, et al M Ann. Neurol. 2J:25-31, 1988) identified 5 VG kindreds with 
early-onset AD. Subsequently 2 additional early-onset VG families were identified 
(Bird, et aL Ann. Neurol. 25:12-25, 3989). In addition, other families with early-onset 
disease have been identified in which, similarly to the VG kindred, the APP gene and 

15 the chromosome 14 locus have been excluded as harboring the causative gene. These 
families include 4 Swedish families with a mean agc-of-onset ranging from 55 to 64 
and 2 Dutch families with mean onsets of 52 and 59 years. The 7 VG families are 
descendents of inhabitants of 3 adjacent villages from the Volga region of Russia. 
Although a common ancestor could not be traced, the unique ethnic origin coupled with 

20 1 50 years of relative genetic isolation suggests that AD in these families is likely the 
result of a common genetic founder. Inheritance of AD in these kindreds appears to be 
autosomal dominant. Onset of AD in these families ranges from 40 to 81 years with 
family mean age-of-onsets ranging from 51 to 65 years (Bird et al., supra, Table 1). 
Numerous affected subjects in these families have been extensively characterized, both 

25 clinically and neuropathologically, and at least I affected subject from each family has 
had autopsy confirmation of the diagnosis of AD. Except for the relatively early agc- 
of-onset, AD in the VG is clinically and pathologically indistinguishable from typical 
AD. Neurofibrillary tangles, Ap neuritic plaques, and other changes associated with 
AD have been observed in multiple autopsies. 
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Table 1. VG kindreds used for linkage analysis. 



Family 


Affected 


Autopsied 


Affecteds 


Total 


Mean age of onset 




(n) 


(n) 


sampled 


subjects 


+ SD,<n). range 


H 


8 


3 


4(2) 


4 


59.5 + 3.9 (6) 56-68 


HB 


23 


5t 


6(2) 


30 


60.8 + 7.1 (22) 54-75 


HD 


19 


2 


6 


20 


59.6+10.3 (17) 46-82 


KS 


14 


3 


8(1) 


29 


64.8 + 5.4 (13| 55-71 


R 


20 


5 


9(3) 


37 


50.2 + 7.3 (17) 40-67 


W 


4 


2 


2 


4 


59.2+ 10.5 (4) 48-79 


WFL 


6 


2 


2 


15 


63.8 + 7.6 (6) 55-76 


Totals 


94 


22 


37(8) 


139 


59.1 + 9.1 (86) 40-82 



15 * Includes I unaffected subject autopsied. 

t Numbers in parentheses indicates DNA samples obtained from autopsy material, either as paraffin 
blocks or frozen brain. 

The locus responsible for AD in ihe VG kindreds has not been 

20 previously mapped genetically. Linkage analysis, as well as mutational analysis, of the 
VG families with chromosome 21 and chromosome 14 markers (AD3 locus and APOE 
locus) has yielded negative results. Moreover, the locus responsible for AD in the VG 
kindreds does not cosegregate with markers at the APP locus, and no mutations have 
been detected in the APP gene. 

25 Linkage analysis was used to scan the genome for the locus causing AD 

in these VG families. DNA was prepared from lymphoblastoid cell lines from 139 
individuals including 37 affected subjects. When suggestive evidence for linkage was 
found, autopsy-derived tissue, either frozen or embedded in paraffin, was used to 
prepare DNA from 8 additional affected subjects for whom no other tissue was 

30 available. Markers on all chromosomes were genotyped and analyzed using the lod 
score method (Morton, Am. J. Hum. Genet. 7:277, 1955; Hodge, et aI M Am. 1 Hum. 
Genet. 35\\ 139, 1983). For the genome screen, evidence for linkage was evaluated 
assuming autosomal dominant inheritance with age-dependent penetrance and a 0% 
sporadic rate. Lod scores were also computed using a low (1%) penetrance model, 

35 which makes no assumption about the disease status of at-risk individuals and thus 
serves as a check that information about linkage derives primarily from the affected 
individuals (in whom the genotype at the disease locus is more accurately known 
compared to that of at-risk subjects). Published marker allele frequencies (Genome 
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Database) were used unless critical allele frequencies were significantly lower than 
those estimated in the VG. In this case, frequencies obtained from the VG pedigrees 
using all subjects (affecteds, at-risk and spouses) were used. This conservative 
approach was taken during screening since underestimation of the frequency of an 
5 allele co-segregating with the disease can cause false positive evidence for linkage (Ott, 
Am. J. Hum. Genet. 57:283, 1992). However, this approach will underestimate lod 
scores if the true allele frequency is lower than assumed. In this study. 162 publically 
available markers were genotyped. Among these were 70 STRP markers with 
heterozygosity values mostly > 0.70 spaced > 20 cM apart. 

10 This work demonstrates the presence of a previously unmapped AD 

locus on chromosome 1 q3 1 -42. 

Initial suggestive lod scores for chromosome 1 were obtained for 
markers D1S103 (Z roM = 1.79, 9 = .10) and D1S249 <Z mt , - 1.76, 6 = .20) (Table 2), 
which are separated by approximately 26 cM (Fig. 1 ). Subsequently, 21 other markers 

15 in this region spanning 55 cM were analyzed (Table 2). Using the conservative 
screening analysis conditions, significant evidence for linkage (Z^ > 3.0) was obtained 
with D1S479 (Z^ = 4.40, G = .11). This lod score was similar to that predicted by 
earlier power analyses. Most of the evidence for linkage comes from families HB and 
R, with positive lod scores also observed for families W, H, and HD (Table 3). 

20 Subsequently, a 112 bp allele of D1S479 co-segregated with the disease in 5 of 7 
families, which is consistent with a common genetic founder. Other markers in the 
region also gave positive though non-significant lod scores (e.g. Dl S439, Z m2x = 2.82, 0 
= .17; DIS320, Z^ = 1.87,0 = .14; Dl SI 03, Z mK = 2.40, 0 = .008; Table 2). 

Table 2 demonstrates that FAD is linked to chromosome 1 markers. 

25 Genotypes were performed by conventional methods, such as by PCR. AH genotypes 
for D1S479 were determined in duplicate. For markers D1S227, D1S320, D1S213, 
D1S439, D1S479, D1S225, and DIS103, genotypes were determined for 8 samples 
from affected subjects either derived from either paraffin blocks or frozen brain 
material (not all samples amplified with all markers). Lod scores were computed using 

30 the computer program LIPED (Ott, Am. J. Hum. Genet.. 26:588, 1974) assuming age- 
dependent penetrance (Morton, Am. J. Hum. Genet., 7:277, 1955). 
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Table 2. 

Lod scores for linkage of FAD to chromosome 1 markers*, 





nci. 


n Am 
U.UUI 


0.05 


0.10 


0.15 


0.20 


0.30 


0.40 


D1S238 


ft 

U.oO 




-1.99 


-0.34 


0.42 


0.76 


0.81 


0.43 


D1S422 


n 76 


7 75 


-1.70 


-0.28 


0.38 


0.67 


0.65 


0.27 


D1S412 


0 71 


7 °7 


-1.66 


0.18 


1.00 


1.31 


1.17 


0.54 


D1S306 


061 


-14 71 
-I**. M 


7 fC 

-7.75 


-4.69 


-2.88 


-1.71 


-0.43 


0.01 


O1S310 


0.57 


-6 01 


-1.75 


-0.79 


-0.31 


-0.06 


0.14 


0.11 


D1S249 


0 87 


-O 7A 


-1.12 


0.65 


1.44 


1.76 


1.60 


0.81 


D1S245 


0 87 


.IT IC 

-IZ. 1 J 


-2.56 


-0.42 


0.58 


1.05 


1.11 


0.55 


D1S205 


0 80 

w.Ou 


- 1 1 .OO 


-3 J 9 


-1.45 


-0.48 


0.04 


0.38 


0.20 


DIS425 


0 81 


•O.JO 


-2.38 


-1.02 


-0.32 


0.05 


0.27 


0.15 


D1S217 


w.UU 


-1 87 


-0.97 


-0.27 


0.10 


0.30 


0.32 


0.11 


D1S237 


0 76 


-*#.// 


-1.88 


-0.76 


-0.13 


0.20 


0.36 


0.20 


D1S229 


0.77 


-6.54 


-1.42 


-0.34 


0.12 


0 28 


U. 18 


A AO 

-0.08 


D1S227 


0.67 


-2.23 


0.23 


0.80 


0.92 


0.83 


0.34 


-0.07 


D1S320 


0.70 


-8.99 


-123 


0.36 


1.03 


1.27 


1.09 


0.42 


D1S213 


0.86 


-3.80 


-1.57 


-0.80 


-0.37 


-0.13 


0.05 


0.04 


D1S479 


0.80 


-0.05 


3.91 


4.39 


4.25 


3.81 


2.50 


0.99 


D1S439 


0.80 


-5.59 


1.05 


2.36 


2.78 


2.76 


1.97 


0.73 


D1S251 


0.82 


-15.43 


-5.19 


-2.67 


-1.31 


-0.51 


0.14 


0.14 


D1S225 


0.79 


-10.44 


-2.62 


-0.48 


0.60 


1.12 


1.16 


0.52 


D1S103 


0.80 


0.33 


1.60 


1.79 


1.74 


1.57 


105 


0.46 


D1S459 


0.79 


-20.40 


-5.86 


-2.87 


-1.39 


-0.59 


0.02 


0.07 


D1S446 


0.69 


-12.37 


-4.50 


-1.99 


-0.71 


-0.02 


0.46 


0.32 


D1S235 

* ¥ • 


0.68 


-7.54 


-1.09 


027 


0.85 


1.06 1 


0.88 


0.34 



*m. 1 a ., TT T * puousnca aueie trequencies except for D1S479. For 

tfus marker, the allele frequency for the 1 12 bp allele was derived fr 0m the VG subjects 
as a group, while frequences for the other alleles were from published values (17) 
Het, heterozygosity. 
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In Table 3, the family lod score for likage of AD to the marker Dl S479 
is presented. Conditions I and 2 are age-dependent penetrance with VG and CEPII 
allele frequencies respectively; conditions 3 and 4 are age-dcpendcnt penetrance with a 
sporadic correction function using VG and CEPH aJlele frequencies, respectively. For 
5 the H, HB, and R families, maximum lod score were obtained with D1S479. For other 
families, maximum lod values were obtained with other markers (HD; D1S103, 2 max = 
1.75, 9 = 0.001, condition 4), D1S249 (KS; D1S412, Z ndX = 1.20, 9 = .10, condition 2), 
and D1S439/DIS227 (W; D1S227 and D1S439, = 0.78, 9 = .001, condition 4). No 
marker was positive for the WFL family under any condition. Under the low (1%) 
1 0 penetrance model, values were 3.04 (9 - . 1 0) and 1 .27 (9 = . 1 5) for the control and 
VG allele frequencies, respectively. 
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Table 3 

Family lod score for linkage of AD to D1S479. 



Family 


Condition 


0.001 


0.05 


0 10 

V.IV 


0 1 s 


A 7A 
U.ZU 


0.30 


A Aft 

0.40 






0.45 


U.JO 




nil 
U.Z 1 


0.15 


0.06 


0.01 




7 


V.HH 


U.JO 




a *) i 


0.15 


0.06 


0.01 




3 


0.90 


I fl 7K 


A A£ 


A <d 


0.42 


0.21 


0.06 




4 


0.90 


fl IS 


A 

U.OO 




0.42 


0.21 


0.05 


HR 


| 


1.67 


I./5 


1 7 1 




1.33 


0.80 


0.26 




7 
* 


l.UV 




1 Ail 
1.04 




1.27 


0.76 


0.25 






7 IS 


7 47 


2~i i 


n id 


1.93 


1.31 


0.59 




4 


7 11 


7 4A 


7 "ifi 




1.87 


1.27 


0.57 


HD 




-1.84 








-0.08 


-0.05 


-0.02 




2 


-0.32 


-v. 1 J 




A A") 
-U.UZ 


-0.02 


-0.03 


-0.02 




3 


-1.30 


0 07 
v.v # 


n 7R 


n "id 

V.J** 


a n 


0.19 


0.07 




4 


6.22 


0.40 


n^c 

W."f J 


0 44 


A lO 

U.J8 


0.21 


0.07 


KS 


1 


-1.57 


-0.28 


a a* 

•v.UO 


n m 

v.v J 


A AC 
U.Uj 


0.03 


0.00 




2 


-1.40 


-0.26 


-0.05 


0.03 


0.05 


A fll 


A AA 
v.UU 




3 


-1.57 


-0.28 


-0.06 


0.03 


0.05 


0.03 


0.00 




4 


-1.40 1 


-0.26 


-0.05 


0.03 


0.05 


0.03 


0.00 


R 


1 


1.07 


2.32 


2.49 


2.40 


2.19 


i .j j 


A 7A 
v. IKf 




2 


1.34 


2.51 


2.58 


2.46 


2.22 


1.56 


0.71 




3 


1.46 


2.69 


2.84 


2.73 


2.48 


1.76 


0.83 




4 


1,73 


Z88 


2.94 


2.78 


2.52 


1.78 


0.83 


W 


1 


0.67 


0.59 


0.50 


0.42 | 


0.34 


0.18 


0.05 




2 


0.67 


0.59 


0J0 


0.42 


0.34 


0.18 ! 


0.05 




3 


0.67 


0.59 


0.50 


0.42 


0.34 


0.18 


0.05 




4 


0.67 


0.59 


0.50 


0.42 


0.34 


0.18 


0.05 


WFL 


1 


-0.50 


-0.41 


-0.32 


-0.24 


-0.17 


-0.07 


-0.02 




2 


-0.50 


-0.41 


-0.32 


-0.24 


-0.17 


-0.07 


-0.02 




3 


-0.49 


-0.40 


-OJl 


-0.24 


-0.17 


-0.07 


-0.02 




4 


3.49 


-0.40 


-0.31 


•0.24 


-6.17 


-0.07 


-0.02 


Totals 


1 


-0.05 


3.91 


4.39 


4.25 


3.81 


2.50 


0.99 




2 


2.05 


5.92 


6.29 


5.99 


5.37 


3.61 


1.57 




3 


1.83 


437 


4.60 


4.34 


3.85 1 


2.50 


0.99 




4 


3.94 


6.39 ! 


6.49 


609 


5.41 


3.60 


1.56 
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In the VG pedigrees, genotypes are not available from many deceased 
individuals who are related to sampled subjects (e.g. all individuals in generations Mil 
in Fig. 2). Because of this missing data problem, marker allele frequencies can 
influence linkage analysis results. For most of the markers used, allele frequencies in 
5 the VG families were similar to those from controls. However, D1S479, in 5 of the 
families (H, HB, HD, R, and W), was substantially more frequent in the affected 
subjects (0.32 in affected subjects, 0.18 in all VG subjects,) compared to a frequency of 
0.04 in controls. Such an increase in frequency for a closely linked marker allele is 
expected. When control frequencies based on VG unaffected spouses were used for the 

10 1 12 bp D1S479 allele, the maximum lod score for D1S479 increased from = 4.40 
(6 - .11 ) to = 6.29 (8 = .10) (Table 3). 

Linkage analysis may be affected by the presence in the pedigrees of 
phenocopies (non-AD dementia cases or AD caused by something other than the major 
gene segregating in these families). Phenocopies can reduce the power to detect 

15 linkage and increase the estimated 8 (Cavalli-Sforza and King, Am. J. Hum. Genet, 
38.599, 1986). Phenocopies are a potential problem in the VG kindreds since the range 
of onset ages for AD in these families extends up to 82 years of age (Table 1), which 
overlaps with common late-onset AD. To correct for the presence of late-onset AD 
cases in the families, a phenocopy correction model was used (Schellcnberg, et al., Am 

20 J. Hum. Genet. 5J:619, 1993), which assumes that as the age-of-onset of a subject 
increases, the probability that the case is a phenocopy is increased. Despite the fact that 
in the absence of true phenocopies this model reduces the power to detect linkage, for 
all markers analyzed, the lod scores increased (Margaritte.et al., Am. 1 Hum. Genet. 
50:1231, 1992). For D1S479, Z roix values increased from 6.29 to 6.51 and from 4.40 to 

25 4.60 using the control and VG allele frequency for the 112 bp allele, respectively. 
Under this model, the maximum D1S479 lod score for a single family for D1S479 was 
2.95 (9 = .09) for the R family. These results support the hypothesis that at least some 
of the AD cases in these families are probably phenocopies. 

Haplotypes were constructed for the R family using the 23 markers 

30 spanning approximately 55 cM of chromosome 1 (Fig. 7). A common haplotype 
between D1S439/D1S479 and D1S306 was observed in all but I of the affected 
subjects (IV- 1, Fig. 7). This individual did not share the disease haplotype across the 
entire region, had an age-of-onset of 67 years, which is greater than 2 SD above the 
family mean, and had an e4/e4 genotype at the APOE locus. This APOE genotype is 

35 associated with lower age-of-onset for late-onset AD (Corder, et al., Science 267:921, 
1993). Thus, this subject may be a phenocopy. In other families, definite assignment 
of a common disease haplotype was difficult because of missing data, and will require 
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extensive multipoint analysis. However, for DIS479, the 1 12 bp allele segregated with 
the disease in the H, HB, HD, W, and R pedigrees. In the HD family, the 1 affected 
subject who did not have the 1 12 bp allele, had an age-of-onset of 75 years and was e3/ 
e4 at the APOE locus. However, age-of-onset alone could not be used to 
5 unambiguously determine whether a subject is a phenocopy. In the HB family, subjects 
with onsets of up to 75 years shared the D1S479 112 allele with other younger affected 
subjects. The KS family is the only VG kindred in which no affected subject had the 
DIS479 1 12 bp allele. Analysis of this family is complicated because of the late age- 
of-onset (64.8 years) and because of the 8 affected subjects, 3 were APOE t4A;4 

10 homozygotes (onset ages of 57, 58, and 67 years) and 4 were c4 heterozygotes (onset 
ages 67, 68, 68 and 71 years). Thus, the KS family could be an example of genetic 
heterogeneity within the VG kindreds, although group heterogeneity tests did not detect 
heterogeneity. In summary, the finding of the same rare allele segregating with AD in 
most of the VG pedigrees supports the hypothesis of a common genetic founder for 

1 5 most of the families. As described herein, the sequence data from VG patients confirms 
this hypothesis. 

Taken together, these data demonstrate the presence of an AD locus at 
lq31-42. Analysis of recombinants in the R family define the candidate region as 
between D1S225 (subjects VI-3 and V-3 in Fig. 2) and DIS217 (subject IV- 16), a 
20 region spanning 14 cM (Fig. 8), 

EXAMPLE 2 

Clonine of the AD4 gene from chromosome 1 using YACS 

25 YAC clones derived from chromosome 1 were obtained from the CEPH-Genethon 
library and were identified for the region spanning D1S229 to D I SI 03 by 2 different 
methods. 

In the first method, the CEPH-Genethon YAC library was arrayed for 
PCR-based screening. This publically available library (Bellanne-Chantelot, et aL Cell 

30 70, 1059-1068, 1992) was obtained from Glen Evans (Salk Institute) as 21,792 clones 
in 227 96-well plates. For DNA preparation, colonies were grown separately in 1.5 ml 
liquid media, harvested, and pooled. Spheroplasts were made by digestion with 
lyticase. DNA was purified by phenol/chloroform/isoamyl alcohol extractions and 
precipitated by the addition of sodium acetate and ethanol. After resusperision and 

35 RNase treatment, the DNA was ready for PCR amplification. 

A two-dimensional YAC pooling scheme allowed the identification of 
specific YAC addresses by PCR screening (Amemiya, et aL, Nucl. Acids. Res. 
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20:2559-2563, 1992). The library was subdivided inlo five sets (4 x 48 and 1 x 35 
trays). Duplicates were made of each set. One replica of each of the 5 sets was used 
for the first dimension basic horizontal pools (each pool represents all clones from a 
single 96-well plate). The second replica was used for second dimension basic vertical 
5 pools (pools containing clones from a specific well position such as Al for plates from 
a set). The pools within each set were further combined to facilitate PCR screening. 
For each set, horizontal pools were combined to generated to 4 super-horizontal pools. 
Each super-horizontal pool contained clones from 12 consecutive plates (plates 1-12, 
13-24, 25-36, 37-48). The second dimension was combined to produce 8 super-vertical 

10 pools per set. Each super vertical pool contained clones from each row in a set (A, B, 
C,...etc). Hence the entire library was reduced to 19 super horizontal pools (4X4 sets 
+ 3X1 set), and 40 super vertical pools (8X5 sets). In this way, candidate clones can 
be found after two rounds of PCR i.e. a first round with super pools and a second round 
with the basic pools. When multiple hits occur, a final round of PCR is necessary. 

15 The YAC library was screened using primers for the STS D1S320 and 

D1S479 as shown in Figure 9. This screening yielded the following YACs: 920e7, 
859b6, 904O. 

In the second method, a data base was queried. The CEPH-Genelhon 
INFO clone data base contains information concerning the genetic marker and STS 

20 content of YACs in the CEPH-Genethone YAC library. This database was queried 
with the following markers: D1S229, DIS227, D1S213, D1S479, D1S439, DIS225, 
D1S103, D1S495. The YACs identified were 857h3. 934(2, 957gl0, 881b8, 913el0, 
8l0e5, 921dl2, 865c8, 920e7, 859b6, 736b6, 746d7, 748h9, 748e2, 753a5, 753d8, 
753d9, 753f9, 753A2, 753h3, 757d8, 759bl0, 757(9, 76lhl0, 767bl, 767dh 767gl, 

25 769fll, 786a8, 786111, 820g5 ? 824g8, 824g9, 824gll, 826b3, 826b5, 831e4, 849a7, 
856a7, 880dl2, 897f9, 899fl, 899hl, 920gl, 904D, 9I5e8. 922al0, 933a8, 958el0, 
978gl0, and 979f3. Some of these YACs contain the marker used in the query and 
some are connected to YACs which contain the marker used in the database query. A 
total of 60 YACs were identified by the combined methods. 

30 YAC DNA was prepared from each YAC clone. The YAC DNA was 

then amplified using the primers shown in Figure 9. A YAC contig map partially 
covering the region from D1S229 to Dl SI 03 was constructed. 

EST T03796 (Figure 10), was identified in the EST GenBank database 
as a sequence with homology to the chromosome 14 AD gene clones both at the 

35 nucleotide level and the amino acid level. Oligonucleotide primers, EP1F and EP1R, 
and KM7749 and KM7750 were used for PCR amplification (Figure 11). Given the 
EST sequence, the expected size was approximately 140 bp (for primers EPIF and 
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EP1R) or 190 bp (for primers KM7749 and KM7750). When human genomic DNA 
was amplified, a 1,300 bp band was observed. Amplification of the YACs from the 
chromosome 1 Volga German gene region with the same primers yielded a 2.5 kb 
fragment from four different isolates of YAC 921dl2. One isolate of 921dl2 did not 
5 yield this fragment, but it was smaller and presumably deleted for the 2.5 kb fragment. 
YAC 92 1 d 1 2 has been previously shown to contain the markers D1S439 and DIS479. 

In addition, a panel of rodent-human hybrid cell lines obtained from the 
Coriell institute for Medical Research were also amplified with the same primers. 
These cell lines contain one or a few human chromosomes on a rodent background 
10 (either mouse or hamster). Only cells containing human chromosome 1 had the 2.5 kb 
band. 

DNA from YAC 921dl2 was amplified with primers EP1F and EP1R 
and subsequently with WWF and WWR (Figure 11) to produce a 2.5 kb fragment. 
DNA sequence analysis of this 2.5 kb fragment was performed with primers WWF and 

15 WWF1L (Figure 2) to give sequence ELL1.1; (Figure 10) this sequence contained at its 
5' end a portion of the EST T03796. This 2.5 kb fragment was also sequenced with 
primers WWR and WWR1R yielding the sequence ELL1.2, which also contained a part 
of the DNA sequence from T03796. The 2.5 kb band from YAC 921dl2 was 
reamplified, purified and sequenced using the primers: WWF. WWF1L. WWR and 

20 WWR1R. The sequence of these DNA molecules is shown in Figure 10. In this 
fragment, 1 19 bp on the 5* and and 69 bp on the 3' end were identical to the EST 
sequence. In addition, a splice acceptor and donor sequence were observed at both 
junctions of the EST sequence, suggesting that the intervening sequence is an intron of 
the chromosome 1 gene. Thus, EST T03796 is on YAC 921D12, and is a candidate for 

25 containing at least part of the Volga German AD gene. 

cDNAs encoding the AD gene were isolated as follows. One million 
clones from a Werner syndrome fibroblast cDNA library or B616 human brain cDNA 
library were plated and transferred to Duralon membranes (Stratagene). The filters were 
prc-hybridized for one half hour in Church-Gilbert Hybridization solution (0.5M Na 

30 phosphate, pH7.0, 7% (w/v)SDS, 1% (w/v)BSA) at 65° C and screened using 500,000 
cpm/ml of a 140 base pair PCR product amplified from the EP1 primers (140 bp of the 
EST T03796) or a 190 bp PCR product amplified from KM7749 and KM7750 primers. 
Probe was labeled using a random primed method (Bwhringer-Marinheim) for half an 
hour and unincorporated nucleotides were removed by a Microspin s-200 HR column 

35 (Pharmacia). The filters were hybridized for 16 hours and then washed 15 minutes each 
at 65° C in 2X SSC/0.i%SDS followed by a wash in 0.1X SSC/0.1% SDS. 
Autoradiographs were exposed for 16 hours and four primary candidates from the 
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fibroblast library and five primaries from the brain library were isolated. The primary 
candidates were positive upon rescreening. Secondary positives were picked into SM 
buffer and rescued using published protocols (Stratagene). 

In addition, 600,000 clones in an arrayed version (Munroe,et al., Proc. 
5 Nad Acad. Sci USA 22:2209-2213, 1995) of the above fibroblast cDNA library were 
screened using PCR methods. The EPl and WW1 primer pairs were used to screen the 
plate pools (1-48). The EPl primer pair gave positives in plates 1 1, 1 7 and 36. while the 
WW1 primers gave positives in plates 17 and 36. When PCR was done to determine 
column and row addresses only two addresses could be determined: plate 17 C7 (both 

10 primer pairs) and 17 CI 1 (WW1 only). Confirmatory PCR on the individual wells 
showed product for only 17C1 1. Both wells were plated for tertiary screening to obtain 
plaque purified phage. The EPl PCR product was used for screening as above; positive 
plaques were isolated for 17 Cll and were rescued as above. Positive plaques 
eventually purified were designated 11. 15A, 15B, and CI 1, PS2-6, PS2-I0, and PS2- 

15 41. 

The sequence from chromosome 1 was then used to screen several 
cDNA libraries. Positive clones 2.2 kb in length were isolated and sequenced. The 
sequence, shown in Figure 1, shows clear overlap with the ends of the PCR fragment. 
Additional EST sequences with overlapping sequence, and additional cDNA sequences 
20 were obtained by further library screening. 

EXAMPLE 3 
Mutations in Affected VG patients 

25 Using RT-PCR, the entire gene has been amplified from both afTected 

VG individuals and unaffected individuals. The unaffected individuals are from VG 
families and from unrelated lineages. The sequences obtained have been analyzed. In 
affected individuals, there is a nucleotide change in codon 141 (AAC-»ATC) resulting 
in an amino acid alteration from asparagine to isoleucine (N141I). No such change was 

30 found in any unaffected individuals of comparable age. The amino acid sequence in the 
region of the alteration is LLNSVLNTLIMISVIVVMT1 in the unaffected and 
LLNSVLITLIMISVIVVMTI in the affected individuals examined. Affected subjects 
carrying the N141I mutation were identified in 5 (H, HB, HD, R, W) of the 7 VG 
families initially screened. All subjects with the N1411 mutation also carried the 

35 D1S479 1 12 bp allele (allele H y Fig. 7). Subsequently, 2 additional VG families (E and 
BE) were screened for the N1411 mutation. In both kindreds, the affected subjects 
carried the N141I mutation and the D1S479 1 12 bp allele. In total the N1411 mutation 
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was directly observed in 20 affected subjects and inferred to be present from 
spouse/children genotypes in an additional 6 AD subjects; onset ages for the 26 subjects 
ranged from 44 to 75 years. The N141I mutation was also observed in 15 at-risk 
subjects (blood relatives of affected N141I carriers) with ages ranging from 23 to 67 
5 years. No VG spouse (n = 17) had the NI41I mutation. Additional populations 
screened included 84 normal Caucasians, spouses, at-risk subjects and affected subjects - 
from 67 late-onset familial FAD families (n = 223 affected subjects), and 48 affected 
sporadic subjects from an AD clinic population. 

In addition to the N141I mutation, three other variants were identified at 
10 nucleotide 436 (C-VT, subject HB), 496 (C->T, subjects HD and W) and 628 (C->T, 
subjects HD and W). These variants did not change the amino acid encoded at these 
sites (alanine, asparagine, and histidine, respectively). 

EXAMPLE 4 

15 Characterization of the AD4 Gene and its Protein Product 

The sequence of the VG from chromosome 1 was constructed from 
sequence of several clones as described. The sequence is shown in Figure 1. The 
amino acid sequence deduced from this DNA sequence is shown in Figure 2. 

20 Comparison of this sequence with the sequence of the AD3 gene from chromosome 1 4 
shows sequence homology. An alignment of the proteins and genes as constructed 
using the program GAP from the Wisconsin package of sequence analysis software 
(Ver. 8.0.1 - UNIX, September 1994) is shown in Figure 3. The amino acid sequence 
of AD4 exhibits approximately 67% identity to the AD3 gene product. Some notable 

25 characteristics of the AD4 gene product are shown in Figure 5. 

The protein encoded by the chromosome 1 AD gene is a membrane 
protein, as judged by the seven hydophobic segments presumed to be membrane 
spanning regions (Figure 4). The AD4 protein is also referred to as STM2. For STM2, 
all the putative transmembrane helices are capped at the carboxyl-terminal end by a 

30 lysine residue, found either at the very end or within a few residues of the end of the 
transmembrane domains. The N141I mutation, a change from a hydrophilic asparagine 
residue to a hydrophobic isoleucine residue, is at a position directly adjacent to the first 
predicted transmembrane domain. 

A northern blot (Clontech) containing 2 ng of human poly A-plus 

35 MRNA derived from heart brain placenta, lung, liver, skeletal muscle, and pancreas 
was hybridized with the 2.3 kb AD4 cDNA clone as a probe. Hybridization was 
performed in 0.75M NaCl, 0.05M NaH 2 P04, 5mM Na 2 EDTA, 10X Denhardt's 
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solution, 100 microgram/ml salmon sperm DNA, 50% formamide, 2% SDS at 42°C for 
18 hours. Filters were washed with 0.1XSSC, 0.1% SDS at 65°C. Northern blot 
analysis of RNA from a variety of human tissues revealed two AD4-related transcripts 
which were approximately 2.4 and 2.8 kb in size. The larger transcript appears to be 
5 primarily expressed in placenta, skeletal muscle and heart, while the smaller transcript 
is detected in heart, brain, placenta, liver, skeletal muscle and kidney but not lung. A 
sequence-tagged site (STS) from the 3' untranslated region was used to confirm that 
AD4 maps to chromosome 1 and to YAC 921dl2. The two transcripts observed arc 
possibly the result of alternative polyadenylation since two putative polyadenylation 
10 signals were identified in the 3' untranslated region, at nucleotides 1834 and 2309. The 
message sizes for STM2 from Northern blot analysis (2.4 and 2.8kb) are slightly larger 
than the 2.3 kb size predicted from the full-length DNA sequence and the 1.8 kb size 
predicted by the second polyadenylation site observed in the sequence. This apparent 
discrepancy may be due to the anomalous migration of size markers which has been 
1 5 previously observed with this type of Northern blot. 

The normal cellular function of STM2 i$ unknown. If this protein is 
functionally similar to spe4, the C elegans gene with sequence similarity to STM2, it 
may play a role in the cytoplasmic partitioning of proteins. Mutations in STM2 could 
alter intracellular protein trafficking of APP and ultimately lead to altered APP 
20 processing and increased production of Apl-40 or A01-42 (amyloid p protein). 
Consistent with this notion, recent work with fibroblast cell lines from AD3 carriers has 
shown that secretion of Ap ending at position 42 is increased relative to non-carrier cell 
lines. Alternatively, STM2 may function as a G-protein coupled receptor or as an ion 
channel. 



25 



EXAMPLE 5 
Cloning o f AD gene on chromosome 1 



The gene may be obtained from genomic or cDNA libraries or from 
30 genomic DNA or mRNA. The nucleotide sequence of the gene provided herein may be 
used to design oligonuclotides. The primers are optimally 18-25 nucleotides in length 
and preferably contain not more than 50% AT base pairs, do not end in A. and should 
not have self-complementary regions. 

The oligonucleotides may be used as hybridization probes on cDNA or 
35 genomic libraries. For cDNA libraries, mRNA from a specific source, such as a cell 
line or tissue sample, is transcribed into cDNA and inserted into an appropriate vector 
XgtlO, XZAP, pBS, or pSPORT). (see Sambrook et aL Molecular Cloning, Cold 
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Spring Harbor, 1989). The recombinant vector is introduced into an appropriate 
bacterial strain (e.g., DH5a or KW257) and the cells plated on growth medium. 
Alternatively, cDNA libraries representing RNA from various sources are obtained 
commercially (e.g., Clontech Laboratories). In either case, colonies (for plasmids) or 
5 plaques (for phage) arc transferred to a nitrocellulose or nylon membrane and prepared 
for hybridization (Sambrook, supra). Oligonucleotide probes are labeled at their 5' end 
with 3 2 P using T4 polynucleotide kinase. Hybridization and washes are performed 
under standard conditions for the length and GC content of the probe (see Sambrook et 
ah, supra). 

10 For isolation of genomic clones containing the AD4 gene, genomic 

libraries are constructed by standard methods (Sambrook et al., supra) or obtained from 
commercial sources (Clontech, Stratagene, Research Genetics and Genome Systems). 
Libraries are constructed in phage, cosmids, PL BACs, or YACs and probed as 
described for cDNA libraries. Colonies or phages that hybridize are picked, purified 

1 5 and grown for preparation of DNA. 

If a full-length sequence is not obtained by the cloning and screening 
methods described, full-length sequences may be obtained by screening additional 
clones, by RACE (Frohman and Martin, Proc. Natl. Acad. ScL USA #5:8998-9002, 
1988), ligation-anchored PGR (Troutt et al., Proc. Natl Acad. Set. USA #9:9823-9825, 

20 1992) or other similar methods. Verification that a clone contains AD4 sequence may- 
be obtained by determining the DNA sequence of the clone and comparing it to the 
sequence of AD4 provided herein. Sequence identity of 90% or better indicates that the 
clone encodes AIM. Some mismatch may be due to polymorphisms or DNA sequence 
errors. 



25 



EXAMPLE 6 
Detection of mutations 



Mutations in the AD4 gene may be determined by nucleotide sequence 
30 analysis. Peripheral blood cells are obtained from patients by venipuncture and 
hypotonic lysis of erythrocytes. DNA or mRNA is isolated from these cells and the 
AD4 gene isolated by amplification. Oligonucleotide primers are chosen based on the 
cDNA sequence provided herein or genomic sequence. A computer program such as 
Primer 2.2, which is publically available from the web-server at the Whitehead/MIT 
35 Center for Genome Research, assists in choosing oligonucleotide primers. If mRNA is 
used as a source of nucleic acids, first strand cDNA if synthesized by standard methods. 
Amplification of DNA is performed and the DNA sequence of the PCR product is 
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determined. A comparison of the sequence obtained from the patient and from a 
normal control allows identification of mutations. 

In addition, the N141I mutation in AD4 gene creates a Sau3A I 
restriction site, which may he used to rapidly screen for mutant alleles. Genomic DNA 
5 or cDNA may be directly screened for the SaulA I restriction site or amplified by PCR 
using primers which flank the 141 codon. DNA or amplified DNA is restricted with 
Sau3A I and electrophoresed in an appropriate percentage agarose gel. The presence of 
the Saul A I site is detected by ethidium bromide fluorescence for amplified products or 
by probe hybridization following transfer of the gel to nylon membrane. Suitable 

10 probes are oligonucleotides or DNA fragments containing sequence near or 
encompassing the 141 codon. Preferably, in the case of an oligonucleotide probe, there 
is at least 24 bases that do not span codon 141 . 

As shown in Figure 12, the N14U mutation is detectable after 
amplification. Approximately 60-100 ng of genomic DNA was amplified with primers 

15 WWF and INTIR (intronic primer 5-GTGGGGCAGACGGAGAGAA-3') at 40 ng 
each, 1 .5mM MgCL and 200 uMdNTPs. Amplification was performed with an initial 
incubation at 95° C for 3 min. followed by 35 cycles of 45 sec. at 94° C, 45 sec. at 
60° C, and 45 sec. at 72° C, with a final incubation for 1 0 min. at 72°. The 1 60 bp PCR 
product (in 20 ul) was digested with 10 U Saul A and the fragments resolved by 

20 electrophoresis in a non-denaturing 12% polyacrylamidc gel. Fragments were 
visualized by ethidium bromide staining. The N 141 1 mutation results in fragments of 
102 bp, 58 bp, 43 bp and 15 bp (not shown) while non-carriers have only the 102 and 
58 bp fragments. Lanes arc: I, at-risk, 79 years; 2, affected, onset 60 years; 3, at-risk, 
67 years of age; 4, affected, onset 75 years (presumed phenocopy); 5, Boehringer 

25 Mannheim DNA size standard V; 6, affected, onset at 52 years; 7, affected, onset 46 
years; 8. affected, onset 49 years. 



EXAMPLE 7 

Determination of the DNA Sequence of the AD4 genomic rep ion 

30 

A 2.5 kb fragment of DNA from the AD4 gene was amplified from YAC 
921dl2 DNA using primers WWF and WWR and radioactively labeled by nick- 
translation (Sambrook, et al. supra). A human genomic clone hybridizing to this 
fragment was identified in a commercially available library (Genome Systems PI 
35 library). The hybridization was carried out according to the manufacturers instructions. 
PCR amplification using primers specific for the 5* and 3' ends of the AD4 cDNA 
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confirmed that the PI clone (clone #6004 in the nomenclature of Genome Systems) 
carried the entire AD4 gene. 

DNA from PI clone 6004 was sheared and subcloned into M13mpl8 
vector prepared by digestion with Smal and de phosphorylation. The vector was 
5 purchased from Novagen (Catalog # 69996-1 ) and the fragments were cloned according 
to the manufacturers instructions. Approximately 1600 clones were isolated. The 
sequence of the insert was determined by fluorescent detection on an ABI373A 
according to the manufacturers instructions. The data was assembled using the 
program Xgap (Staden, R. (1994) The Staden package, hi "Methods in molecular 
10 biology" A.M. Griffin and H.G. Griffin, Eds., Vol 25, pp. 9-170, Humana Press). 
Exons were identified by entering the cDNA sequence into the assembly. 

The sequences of the 12 exons and the flanking intron sequence of the 
AD4 gene are shown in Figures 13 to 19. The relative positions of the exons in these 
sequences are listed in Table 4. 

15 

Tabic 4 

Position of exons in the genomic sequence 





Exon 


Genomic 


Exon 


Exon 


20 




sequence 


position 


leneth 




exonl 


stm.genA 


546 


143 




exon2 


stm.genB 


850 


186 




exon3 


stm.genC 


327 


161 




exon4 


stm.genC 


2140 


215 


25 


exon5 


stm.genD 


168 


142 




exon6 


stm.genE 


808 


68 




exon7 


stm.genE 


1555 


221 




exon8 


stm.genF 


102 


99 




exon9 


stm.genF 


1345 


84 


30 


exon 10 


stm.genF 


1809 


102 




exonl 1 


stm.genG 


1053 


117 




exon I 2 


stnugenG 


2470 


683 
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EXAMPLE 8 

Isolation of the gene fro m patient DNA and detection of polymorphisms. 

Treatment of Alzheimer's disease is more effective if begun prior to the 
5 onset of neural degeneration. However, current diagnostics only effectively identify 
patients after substantial loss of cognitive function. Basically, loss of cognitive 
function in an older patient without any detectable cause is the primary indicator of 
Alzheimer's disease. Strict confirmation of the diagnosis is made by autopsy. Any 
treatment which blocks the initiation or progression of Alzheimer's disease will be 

1 0 much more effective if an earlier diagnosis is made. 

One dominant mutation of the AD4 gene is known to cause early-onset 
Alzheimer's disease. Other mutations in this gene or in other members of the same 
gene family may cause or contribute to Alzheimer's disease. We provide herein 
methods for detection of mutations in the AD4 gene. The same techniques will allow 

1 5 detection of mutations in other members of the gene family. 

Ten pairs of oligonucleotide primers used to PCR amplify ten fragments 
carrying the 12 AD4 exons are listed in Table 5. DNA is isolated from patient blood 
samples and individual PCR reactions carried out with each set of primers using the 
Pcrkin-Elmer PCR kit according to the manufacturers instructions in a Perkin-Elmer 

20 9600 PCR machine with the following cycle conditions: 2 minutes at 92°C, 35 cycles 
of 20 seconds at 92°C, 45 seconds at the indicated annealing temperature, and 3 
minutes at 72°C, followed by 7 minutes at 72°C. PCR fragments are purified by gel 
filtration in a 96-wetl format spin column with Sephracyl-500HR resin (Wang, Can 
Boysen and Hood, Anal. Biochem. 226: 85-90, 1995). Mutations are detected by 

25 fluorescent dye terminator sequencing on the ABI373A using the sequencing primers 
specific for each exon as listed in Table 6. Many other mutation detection formats are 
possible using the PCR amplified fragments. 

A diagnostic test will have several uses in addition to the eventual use of 
targeting patients for early intervention with Alzheimer's drugs. First, clinical trials of 

30 any Alzheimer's drug would benefit from monitoring the allelic variations in genes 
which predispose towards the disease. The genetic background of the patients may be 
correlated with different mechanisms for initiation of the disease: a drug which is 
effective against one disease mechanism may not be effective against disease caused by 
other mechainsms. Second, the diagnostics may be used to choose the appropriate 

35 treatments if clinical trials show that different genetic backgrounds are correlated with 
the efficacy of drugs. 
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TableS 

Primers for PCR of cxon fragments 



Exons 
amplified 8 


Printer 


Sequence (5'-r) 


PCR product 
si2e*> 


PCR conditions 
MgCl2(mW) 


Annealing 
temperature (°C) 


1 


INTI05L 
IKT106R 


GGCGTTTTGTTCTTCTTCTCTC 
ACCAAAGGGTACAGAGTTTTCCC 


956 


1.0 


60 


2 


INT108L 
INT107R 


CATCCCCAAGTQGTATGTGTTA 
CTCCCCA6CCGTGCTTGTG 


610 


1.0 


60 


3 


IKTIUL 
INTII2R 


CGTTTGATTGACAGGCATCTCT 
CAAGGTGGCAGAGTGGACAG 


549 


>0 


61 


4 


INT1I5L 
INTU6R 


CATATGCCCTAGTAGCTCATAG 
CCATTATACGAACAAGGAAGCTG 


596 


0.75 


56 


5 


WTII3L 
IKTII4R 


GCTCATGGGGATGGTGCCC 
GAAGCCGCTGTCTAAAAGCTTAT 


1149 


1.0 


60 


6.7 


INT102L 
INT10IR 


CCCTCCCAGCGTAGGCATGAA 
GCAGAAAGGGATGCAAGCCC 


1200 


2.0 


62 


8 


INT103L 
IKTI09R 


GGCATGCTCTGAGAGCTCCA 
GCCCAGAAGCTCCTTGCTCC 


572 


1.0 


60 


9,10 


IKT110L 
INTI04R 


TGGACCCCTCCCACAACGG 
TCACAGAGATGCCTCTGATGG 


868 


1.0 


60 


11 


INTI17L 
INT118R 


ACTGGTTTTACCTGAGTrTCAGA 
CGAGCACACCTCCTGTGAG 


568 


1.0 


58 


12.3UTR 


INT7L 
DMO 1009 


GGTGTCTAGCACCGTTATCC 
ACAGGACTCATCTATTTATGGATA 


803 


2.0 


60 



3NSDOC1D: <WO 97031 B2A3_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 97/03192 PCT/US96/11386 

54 



Table 6 

Primers for sequencing exons from PCR templates 



5 



Exon 


Primer seauence 


Primer Position 


Exon 1 


AACACCGTTCATACCTT 


808 


Exon 2 


CAATTGTCTGGGTATCA 


800 


Exon 3 


CTTTGTCTCACAGGAAA 


256 


Exon 4 


CGTGCATTACATGGATA 


2062 


Exon 5 


GTC CAG AATCACTCAAG 


79 


Exon 6 


AT CAGTCTCAGGATCCT 


719 


Exon 7 


GCTGACTGTCTGGAACT 


1584 


Exon 8 


CCAGTCAACTCTGAAAG 


288 


Exon 9 


CTTGCTCAATCTTCAAG 


1582 


Exon 10 


CCTGGTAACACTCTGAC 


1708 


Exon 1 1 


TCCACATCTTAGCTTCTAG 


912 


Exon 12,5* 


CTAGACCATGACTCACAG 


2397 


Exon 12, 3 ^ 


AGAGGGGTCAACATTTAC 


3179 



20 

EXAMPLE 9 
Expression ofSTM2 

25 The expression pattern of STM2 was examined in multiple tissues by 

Northern blot analysis (Fig. 20). A probe was generated by amplifying the STM2 
cDNA clone using primers ADC22R2 and ADC22L5. Identical patterns were obtained 
with probes made using primer pairs DMO 101 8 and DMO 1021 or DMO 1009 and 
DMO1014. A 2.4 kb message corresponding to STM2 was observed in all tissues 

30 examined, including all regions of brain sampled. (In kidney, liver and lung, longer 
exposures of the same Northern blot showed STM2 to be present at low levels in these 
tissues; data not shown). High expression levels were observed in the pancreas, heart, 
and skeletal muscle. In these three tissues, a second 2.8 kb transcript was observed. 
From the STM2 cDNA sequence, two potential polyadenylation signals were identified 

35 at nucleotides 1834 and 2309 (total clone length, 2167 bp). To determine whether the 
2.4 and 2.8 kb transcripts could represent different length 3' UTR polyadenylation 
variants, RNA- blots were hybridized with a probe scanning the entire 3' UTR and with 



BNSDOCID: <WO 9703192A3JA* 




WO 97/03192 PCT7US96/1 1386 

55 

a probe starting after the first polyadenylation site. Both probes gave identical patterns, 
indicating that only the second polyadenylation site was used. 

In an attempt to identify homologues of STM2. liver and pancreas 
cDNA was amplified with degenerate primers based on the STM2 sequence. Twcnty- 
5 two MI 3 recombinant clones were identified and sequenced, all corresponding to 
STM2. Seven of the 22 clones had the sequence GAGATGGAAGAC beginning at 
nucleotide 1327 in the cDNA sequence, which is 3 bp shorter than the previously 
reported sequence (AGATGG|AAGAAGAC, where T indicates the junction between 
exons 9 and 10). The shorter transcript encodes a predicted protein one amino acid 

10 shorter (EMED) versus EMEED) with no frameshift. The shorter transcript is likely a 
result of alternative splicing of exon 10, with the first coding AG dinucleotide used as 
the splice acceptor sequence or a polymorphism. To distinguish between these 
possibilities, RNA and DNA were prepared from a lymphoblastoid cell line from a 
normal (non-AD) VG subject in the HD family. Both transcripts were present in RT- 

15 PCR products using primers EX103L and EX102R from exons 9 and 10. However, 
when the intron 9 to exon 10 region of genomic DNA from the same subject was 
sequenced, a single sequence that corresponds to the genomic sequence was obtained. 
Thus the two different transcripts arise from alternative splicing. Both transcripts were 
also observed in leukocytes, skeletal muscle, and one fetal brain library (Clontech), 

20 with the longer transcript being more abundant. However, in another fetal brain library 
(Stratagene), only the longer transcript was observed, while in a foreskin fibroblast 
library, only the shorter transcript was found. 

Table 7 

« 

25 Primers for RNA analysis 





Primer 


Location 


Primers (5-3') 


PCR produci 


Conditions 


Annealing 










size (kb> 


MgCh 


temperature (°C) 


1 


EXI03L 


Exon 9 


CTGCCATGGTGTG6ACG6TT 


170/173 


1.5 


60 




EX102R 


Exon 10 


CTCCAGCTCCTCCCCTGGG 








2 


ADC22R2 


3*UTR 


GGACAAAGCATTGGGAACACT 


408 


1.5 


56 




ADC22L5 


nrm 


TGAGGGACATGGTGTGCCACAGG 








3 


DMO1014 


3'UTR 


GAACTGAGAAGGTCAGATTA 


268 


1.5 


56 




DMO1009 


3'UTR 


ACAGGACTCATCTAnTATCCATA 








4 


DMO1018 


3*UTR 


ATTGGATGCAGTTGTATAGT 


229 


1.5 


56 




DMO1021 


3'UTR 


TAATCTGACCTTCTCAGTTC 
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From the foregoing, il will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
5 invention. Accordingly, the invention is not limited except as by the appended claims. 
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Claims 



We claim: 



1 . An isolated nucleic acid molecule encoding an AD4 gene product. 



2. The isolated nucleic acid molecule of claim 1 wherein said molecule is 
set forth in Figure I . 

3. The isolated nucleic acid molecule of claim 1 wherein said nucleic 
acid molecule encodes a mutant ACM gene product that increases the probability of 
Alzheimer's Disease. 



4. The isolated nucleic acid molecule of claim 3 wherein said nucleic 
acid molecule encodes a mutant AIM gene product with an amino acid subsitution at residue 
141. 



5. The isolated nucleic acid molecule of claim 4 wherein said substitution 
at residue 141 is an aspargine to isoleucine substitution. 

6. An isolated nucleic acid molecule selected from the group consisting 

of: 

(a) an isolated nucleic acid molecule as set forth in Figure I or 
complementary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to 
the nucleic acid molecule of (a) under conditions of high stringency; and 

(c) an isolated nucleic acid that encodes an AD4 gene product. 

.7. An expression vector, comprising a promoter operably linked to a 
nucleic acid molecule according to any one of claims 1 to 6. 

8. The expression vector according to claim 7 wherein said promoter is 
selected from the group consisting of CMV I-E promoter, SV40 early promoter and MuLV 
LTR. 

9. The expression vector according to claim 7 wherein said promoter is a 
tissue-specific promoter. 



BNSDOC1D. <WO 9703192A3JA> 



WO 97/OJ 192 PCT/US96/1 1386 

58 



10. A viral vector capable of directing the expression of a nucleic acid 
molecule according to any one of claims 1 to 6. 



11. The viral vector according to claim 10 wherein said vector is selected 
from the group consisting of herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. 



12. A host cell carrying a vector according to any one of claims 7 to 1 1 . 

13. The host cell according to claim 12 wherein said cell is selected from 
the group consisting of human cell, dog cell, monkey cell, rat cell and mouse cell 

1 4. An isolated protein comprising an AD4 gene product. 

15. The isolated protein of claim 14 wherein said protein consists 
essentially of an AD4 gene product. 

16. The isolated protein of claim 14 wherein said protein has an amino 
acid sequence set forth in Figure 2. 

17. The isolated protein of claim 14 wherein said protein is a mutant AD4 
gene product that increases the probability of Alzheimer's Disease. 

18. The isolated protein of claim 17 wherein said mutant AD4 gene 
product has an amino acid subsilution at residue 141 . 



19. The isolated protein of claim 17 wherein said substitution at residue 
1 4 1 is an aspargine to isoleucine substitution. 

20. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a vector containing or expressing a nucleic acid molecule 
according to any one of claims 1, 2, or 6, thereby reducing the likelihood or delaying the 
onset of Alzheimer's Disease in said patient. 

21. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a protein according to any one of claims 14 to 23, thereby reducing 
the likelihood or delaying the onset of Alzheimer's Disease in said patient. 
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22. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient an antibody specific for a protein according to any one of claims 1 7 
to 1 9, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in said 
patient. 

23. The method according to any one of claims 20-22 wherein said method 
comprises in vivo administration. 

24. A pharmaceutical composition comprising a vector comprising the 
nucleic acid molecule according to any one of claims 1 and 2, along with a pharmaceutical ly 
acceptable carrier or diluent. 

25. A pharmaceutical composition comprising a host cell according to any 
one of claims 12 and 1 3, along with a pharmaceutical ly acceptable carrier or diluent. 

26. A pharmaceutical composition comprising a protein according to any 
one of claims 14 to 16, along with a pharmaceutically acceptable carrier or diluent. 

27. An antibody which specifically binds to the protein according to any 
one of claims 14 to 19. 

28. The antibody according to claim 27 wherein the antibody is a 
monoclonal antibody. 

29. The antibody according to claim 28 wherein the antibody is selected 
from the group consisting of an Fab fragment, an Fv fragment and a single chain antibody. 

30. A hybridoma capable of producing the antibody according to claim 28. 

31. A pharmaceutical composition comprising an antibody according to 
claim 27, along with a pharmaceuucally acceptable carrier or diluent. 

32. A nucleic acid probe which is capable of specifically hybridizing to an 
AD4 gene under conditions of high stringency. 
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33. A nucleic acid probe comprising at least a portion of the nucleotide 
sequence shown in Figure 1 or its complementary sequence, said probe being capable of 
specifically hybridizing to a mutant AD4 gene under conditions of high stringency. 

34. A probe comprising at least a portion of the nucleotide sequence 
shown in Figure 1 or its complementary sequence, said probe being capable of specifically 
hybridizing to a mutant AD4 gene, in which amino acid residue 141 is changed from 
asparagine to isoleucine, under conditions of very high stringency. 

35. The probe according to any one of claims 32 to 34 wherein said probe 
comprises at least 1 2 nucleotide bases. 

36. A pair of primers capable of specifically amplifying all or a portion of 
a nucleic acid molecule according to any one claims 1 to 6. 

37. The primer according to claim 36 wherein said primer comprises at 
least 12 nucleotide bases. 

38. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 

(b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant AD4 gene under conditions and for time sufficient to ' 
allow hybridization to occur; and 

(c) detecting the presence of hybridized probe, and thereby determining 
that said patient has an increased likelihood of contracting Alzheimer's Disease. 

39. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 

(b) amplifying a selected nucleic acid sequence associated with a mutant 

AD4 gene; and 

(c) detecting the presence of an amplified nucleic acid sequence, and 
thereby determining that said patient has an increased likelihood of contracting Alzheimer's 
Disease. 
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40. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) contacting a biological sample obtained from a patient with an 
antibody that specifically binds to a mutant AD4 protein, under conditions and for a time 
sufficient ro allow binding of the antibody to the protein; and 

(b) detecting the presence of the bound antibody. 

41. A peptide vaccine comprising a portion of a mutant AD4 gene product 
containing a mutation, in combination with a pharmaceutical ly acceptable carrier or diluent. 

42. A transgenic animal whose germ cells and somatic cells contain an 
AD4 gene which is operably linked to a promoter effective for the expression of said gene, 
said gene being introduced into said animal, or an ancestor of said animal, at an embryonic 
stage. 

43. The transgenic animal according to claim 42 wherein the animal is 
selected from the group consisting of a mouse, a rat and a dog. 

44. The transgenic animal according to claim 42 wherein AD4 is 
expressed from a vector according to any one of claims 7 to 1 1 . 

45. The transgenic animal according to claim 42 wherein the AD4 gene 
encodes a mutant AD4 gene product. 

46. An isolated DNA molecule comprising AD4 gene sequence presented 
in Figures 13 to 19. 
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FIGURE 1A 

The coding region for AD4 begins a start codon beginning at 
nucleotide 368 gctlatgjctc and ends with a stop codon beginning 
at nucleotide 1712 atc(tga)ggg 

1 cgagcggcgg cggagcaggc atttccagca gtgaggagac agccagaagc 

51 aagcttttgg agctgaagga acctgagaca gaagctagtc ccccctctga 

101 attttactga tgaagaaact gaggccacag agctaaagtg acttttccca 

151 aggtcgccca gcgaggacgt gggacttctc agacgtcagg agagtgatgt 

201 gagggagctg tgtgaccata gaaagtgacg tgttaaaaac cagcgctgcc 

251 ctctttgaaa gccagggagc atcattcatt tagcctgctg agaagaagaa 

301 accaagtgtc cgggattcag acctctctgc ggccccaagt gttcgtggtg 

351 cttccagagg cagggctatg ctcacattca tggcctctga cagcgaggaa 

401 gaagtgtgtg atgagcggac gtccctaatg tcggccgaga gccccacgcc 

451 gcgctcctgc caggagggca ggcagggccc agaggatgga gagaacactg 

501 — cagtggag aagccaggag aacgaggagg acggtgagga ggaccctgac 

551 cgctatgtct gtagtggggt tcccgggcgg ccgccaggcc tggaggaaga 

601 gctgaccctc aaatacggag cgaagcacgt gatcatgctg tttgtgcctg 

651 tcactctgtg catgatcgtg gtggtagcca ccateaagtc tgtgcgcttc 

701 tacacagaga agaatggaca gctcatctac acgccattca ctgaggacac 

751 accctcggtg ggccagcgcc tcctcaactc cgtgctgaac accctcatca 

801 tgatcagcgt catcgtggtt atgaccatct tctcggtggt gctctacaag 

851 taccgctgct acaagttcat ccatggctgg ttgatcatgt cttcactgat 

901 gctgctgttc ctcttcacct atatctacct tggggaagtg ctcaagacct 

951 acaatgtggc catggactac cccaccctct tgctgactgt ctggaacttc 

1001 ggggcagtgg gcatggtgtg catccactgg aagggccctc tggtgctgca 

1051 gcaggcctac ctcatcatga ccagtgcgct catggcccta gtgttcatca 

1101 agtacctccc agagtggtcc gcgtgggtca tcctgggcgc catctctgtg 

1151 tatgatctcg tggctgtgct gtgtcccaaa gggcctctga gaatgctggt 

1201 agaaactgcc caggagagaa atgagcccat attccctgcc ccgatatact 

12S1 catctgccat ggtgtggacg gttggcatgg c^aagctgga cccctcctct 

1301 cagggtgccc tccagctccc ctacgacccg gagatggaag aagactccta 

1351 tgacagtttt ggggagcctc cataccccga agtctttgag cctcccttga 

1401 ctggc-accc aggggaggag ctggaggaag aggaggaaag gggcgcgaag 
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FIGURE 1B 



1451 cttggectcg gggaettcat cttctacagt gtgctggtgg gcaaggcggc 
ISO! tgccaegggc agcggggact ggaataccac gctggcctge ttcgtggcca 

1551 tcctcattgg cttgtgtctg accctcctgc tgcttgctgt gttcaagaag 

1601 gcgctgcccg ccctccccat ctccaecacg ttcgggctca tcttttactt 

1651 ctccacggae aacctggcgc ggccgttcat ggacaccctg gcctcccatc 

1701 agccccaeac ctgagggaca tggtgtgcca caggccgcaa gctgcaggga 

17S1 attttcattg gatgcagttg tatagtttta cactctagtg ccatatattt 

1801 ttutKttc tetttcctta aaaaataaag tacgtgttta cttggtgagg 

1851 aggaggcaga accagctctt tggtgccagc tgtttcatca ccagactttg 

1901 gctcccgctt tggggagcgc ctcgcttcac ggacaggaag eacagcaggt 

1951 ttatccagat gaactgagaa ggtcagacta gggcggggag aagagcatcc 

2001 ggcacgaggg ctgagatgcg caaagagtgt gctcgggagc ggcccccggc 

2051 acctgggtgc tctggctgga gaggaaaagc cagttcceta cgaggagtgt 

2101 ccccaatgct ttgtceacga tgtccttgtt attttattgc ttttagaaac 

2151 tgagtcctgt tcttgttacg gcagtcacac tgctgggaag tggcttaata 

2201 gtaatatcaa taaatagatg agtcctgtta gaaaaa 
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FIGURE 2 



Protein sequer.ee of 448 amino acids chac re D re S «, t5 -k 

of che Human AD4 gene. represents the gene prccuc: 



1 KLTFMASDSE EEVCDERTSL MSAESPTPRS CQEGRGGPED CENTAQWRSQ 

51 ENESDGEEDP DRYVCSGVPG RPPGLEEELT LKYGAKHVIM tFVPVTLCMI 

101 WVATIKSVR FYTEKNGQLI YTPFTEDTPS VGQRIXNSVL WLIM1SVTV 

151 VMTIFLWLY KYRCYXFIHG WLIMSSLMLL FLFTYIYLGE VUCTVWAMD 

201 YPTLLLTVWN FGAVGMVCIH WKGPLVLQQA YLIMISALMA LVf IKYLPEW 

2S1 SAWVILGAIS VYDLVAVLC? KGPLRMLVET AQERNEPIFP AXIYSSAMVW 

301 TVGMAKLDPS SQGAtQLPTO PEKEEDSTOS FGEPSYPEVF EPPLTGYPGE 

3S1 ELEEEEERGV KUSLGDFIFY SVLVGKAAAT GSGOWNTTIA CFVAILXGLC 

401 LTLLLLAVFK KALPALPISI TFGLIFYFST DNLVRPFMDT IASHQLYI 
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FIGURE 3 



Sequence alignment: of the Chromosome 1 AD4 gene produce with the 
Chromosome 14 A03 gene product. 

AD4 1 MLTFKASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGDITACWRSQ 50 

I " : I - I : I • | j. II .:.|: I - II 

AD3 1 MTELPAPLSYFQ. - MAQMSZDNHLSNTVHSQNDNRERQEH^. . . . .RRSL 44 

AD4 51 QJEEDGSEDPDRrVCSGVPGRPPGLEffiLTLKYGAKHVIMLFVPVTLCMI 100 

::.|. :..|...... : | | | | 1 1 | 1 1 | | | | I I I I I I I I I I • 

45 ^^Laica^sR^^ 94 



AD3 
AD4 
AD3 



AD4 101 Vyy^TIKSVRFrrE^ 150 

- 55 wuayiM^^ 144 

145 ^^^^^^^KIRC^KVTiAWLlISSLLlJ^TTSTIYlJ^VFi^C^LvD 194 

201 YPTIXLT7W^y^p W KGPLviQQAYI.IMIsA^ 2S0 
,« IJ: I HIIJ^Iil^rMinM llllllllllllllllllllilll 
195 YITVALL1WNFGWCMSIHWKGPU^QAY1IMISAIJ«I.VFI^PEW 244 

AD4 251 SA^^^LyA^CPKGPLH^VETAQEHNEPIFPALITSSAMVW 300 

AD4 301 TVGilAKLDPSSQ. . GALQLP YDPEMEE . . . .DSYBSFGEPSYPEVFEPPL 344 

I - 1 1 • 1 I • • | . . . . | : : | . | I . . 
AD3 29S LVNMAEGDPEAQRRVSKNSFTNAESTEJlESQDTVAEimi^GFSEEWEAQR 344 

AD4 345 TGYPG EEL EXEEEJIGVKLGLGDFIFYSVLVG 375 

AD3 345 DSItt*GPHRSTPESRAAVQELSSSIIA^ 394 

AD4 376 1^«SGD1^LAC™^ c2 5 

ad3 ^iili^JI;!' 11 1,111,1 mm mih mil iii i iih 

AD4 426 FYFSTDNLVRPFMDTLASHQLYI 44 8 

c m-N HUM II ||:|| 

AD3 445 FYFATDYLVQPFMDQLAFHQFYI 467 
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FIGURE 5 

Sequence features of the Human AD4 gene produce 



1 MLTFMASDSE EEVCDERT5L MSAESPTPRS CQEGRQGPED GENTAQWRSQ 

51 ENEEDGEEDP DRYVCSGVPG RPPGLEEELT LKYGAKHVIM LFVPVTLCMI 

101 WVATIKSVR FVTEKNGQLI YTPFTEDTPS VGQRIXNSVL NTLXMISVIV 

151 VMTIFLWLY fCYRCYXFIHG WLIMSSLKLL FLFTYIYLGE VLKTYNVAMD 

201 YPTLLLTVWN FGAVGMVCIH WKGPLVLQQA YLUHSALMA LVFIKYLPEW 

Conserved tetramer (KGPL 222-225) 

251 SAWVILGAIS VYDLVAVLCP KGPLRMLVET AQERNEPIFP ALIYSSAMVW 

Conserved tetramer (KGPL 271-274) 

301 TVGMAKLDPS SQGALQLPYD PSMESDSYDS FGEPSYPEVF EPPLTGYPGE 

351 ELEEEEERGV KLGLGDFIFY SVLVGKAAAT GSGDWNTTLA CFVAILIGLC 

Putative (Glycosylat ion Site (N 386) 
*-N-i inked 

401 LTLIXLAVFK KALPALPISI TFGLIFYFST ONLVRPFMDT LASHQLYI 
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DOTPLOT COMPARISON OF THE PRODUCTS OF HUMAN A04 AND AD3 GENES 
HUMAN AD4 GENE PRODUCT (1 TO 448) 




Fig. 6 
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FIGURE 9 



D1S103 
D1S2Z3 
dis225 
D1S227 
D1S22S 
D1S32C 
D1S439 
dl3453 
D1S47S 



ACC3AAC31TTCTACAAGTT31C 

CaiTATCCAAGGTCAGGaGG 
AGCTGTTAATCCAATCTAXGArGTG 

GCCTGG3TGAC3UUVGai 
AGCTOTGTCGCKnrrPGA 

GaGcsArrrcRGaATGTTGc 
ccnrrcn^cArTarGGTG 

ACrCTACTTGTGTGTGAATGTATa 

TTTTGGCXCtZAGTAATATCTTG 
TrGCAGACTCTGaACTXAGAACA 

Caa^ rrTC^TrRGAGGGG 
<*TTtSAAATGGTGaATTTGGA 

. GGGGACC^TwTTTCTGC^lC 

AcccArrscdccarc 

GCK3GAGarrTGGACTGG 
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FIGURE 10 

DNA sequences 

788IIH5.SEQ 

1 AGTGAAATCG TCCTGTGACC ACGCGTCAAG TGCTGATGGG GCACAGCAAC 
51 TTCCGGGCCT ATCATATCTC CTTGACCTCG TCCCTCAAAT CTGGTAGTTT 
10! CTGCACCGAG GGACACAGTC CACTGCGATG AAGTATGTTC AAAATCG^FT 
151 TCTATAGGCA GGTCCTTCCA AAGTCCAATA GTGCAAGGTG GACGm 

ESTT03796 

1 GAGAAGGTAA GAGCTGACCC TCAAATACGG AGCGANACGT NATCATGCTG 
51 TTTGTGCCTG TCACTCTGTG CATGATCGTG GTGGTAGCCA CCATCAAGTC 
101 TGTGCGCTTC TACACAGAGA AGAATGGACA GCTCATCTAC ACG CCATTCA 
151 CTGAGGACAC ACCCTCGGTG GGCCAGCGCC TCCTCAACTC CGTGCTGAAC 
201 ACCCTCATCA TGATCAGCGT CATC3TGGTT ATGACCATCT TCTTGGTGGT 
251 GCTCTACAAG TACCGCTGCT ACAAGTTCAT CCATGGCTGG TTGATCATGT 
301 CTTCACTGAT GCTGCTGTTC CTCTTCACCT ATATCTACCT TGGGGAAGTG 
351 CTCAAGACCT ACAATGTGGC CATGGACTAC CCCACCCTCT TGCTGACTGT 
401 CTGGGAACTT CGGGGCAGTG GGCATGGTGT GCATCCACTG AAGGGGCCTT 
451 TGGTGCTGCA GAGGCCTACC TATCA 



SAGGCCCTCMCCTGCCCTCCAGCCACCCTTCTCTCCGTC 



ELLL1 DNA sequence 

GGTGGNGCTCTACAAGTACCGCTGCTACAAGGTGA 

ELL1.2 DNA Sequence 



A CTTTG 



TGCTGATATACTGTATGTTGTCTATATATCATCTAACACCCTCACACTGGAACACAATGCCCGTGGGCAGAG 

^^^=== 

CCAGCGTAGGCATGAAGTAGCCTAATGAAGAGCATTCAGGCTTGGGTATCAGTCTCAGG 
TAGAATTTCn^CGCTTGG^ACACC^ 
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FIGURE 11 



PCR PRIMERS 



WWF 


GGA CAC ACC CTC GfGT GGG CC 


WWR 


CAA GGT AGA TAT AGG TGA 


EP1F 


CTT GGT GGT GCT CTA CAA GTA C 


EF1R 


GTA GTC CAT GGC CAC ATT GT 


WWFlL 


TCC TCA ACT CCG TGC TGA AC 


WWR1R 


AGC ATC ACT GAA GAC ATG ATC 


1.3F 


ACC ACC GGG GCA ACC TGA 


1.3R 


AGG AAG CAG ATC TCA ATA GGA 


EP2F 


GAG AAG GTA AGA GCT GAC CC 


EP2R 


TGA TAG GTA GGC CTC TGC AGC 


921D12F 


TCC AGC CAC CCT TCT CTC C 


921D12R 


CAG TAT TAG GTC ATC CAC GGT 


INT1R 


GTG GGG CAG ACG GAG AGA A 


ADC22L1 


CCC AAG TGT TCG TGG TGC TT 


ADC22R1 


ACT GTG GAC ATA GCG GTC AGG 


KM7749 


GGT AGA TAT AGG TGA AGA GG 


KM77S0 


TCA CTG AGG ACA CAC CCT CG 
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FIG. 12 
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FIGURE 13 



Genomic sequence of stm.genA 



tttccagcagtgaggagacagccagaagcaagcttttggagctlLglaacctgagalaq 
9cttctagatctgctgtccatccctccgag^ 

a ^ gagaa "5 gagagcattccata 9 a 9 ac g a ttgtagactaacttaactcagaagcgac 
agcctggggttgccaaggctgtctacgaagtaacttgattaggaccgaccccigcSIca 
gtaaggaagcctctgatgcctctgtagccaattctgcagacacctglgcctccaaggcc? 

aa= a l 9 ^^ C ^ tCCtgaatgtggttgtgggca 9 cc 9g t gactggcgIIgItIcI^ 
oal?^! 9 ^ tCttggc " cacctca 5 a 9 ct 9 c 9tcctcacgacgcccacgtIgagcS a 

atgtatcaggtttttggggaaaactctgtaccctttggtgttgaagtcccatglglca 



.970319243 JA> 
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FIGURE 14 



Genomic sequence of stm.genB 



cattcatttagcctgctaaaaaa aaaaa ^2r?rZ?fff_f"5? aaa 9 cc agggagcat 



:aattcag 



:caacctgcttt:tgaaccct< 
-gggagtttgctgtggcattl 
-taaggccaaagcctttacat 
•gggtttccataatgattgce 
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FIGURE 15 



caccacagatgtttgctgattgaatgaatgagcacactgacagtttggagctgccctgac 

tttcgtggctatgcgttttgccccctgggatgtgagtcacctcaggccagccccaggcaa 

ggccgctgctgcctccatggtaactctcaaggcctcttgttttatggcagtcgtttgatt 

gacaggcatctcttggaagcttttggggcaggacttgtgtccaagtctccaggtcgcctc 

cagccaccccctgagtcctccactgcctttgtctcacaggaaagtggaacaaggtccttg 

tgctcctttttccaggtgcttccagaggcagggctatgctcacattcatggcctctgaca 

gcgaggaagaagtgtgtgatgagcggacgtccctaatgtcggccgagagccccacgccgc 

gctcctgccaggagggcaggcagggcccagaggatggagagaanactgcccagtgggtag 

gtcccaccagcagctgggggccttcaaacaggtcctgcggctactgtaccttacagatga 

aaaccagacattcattccctgatgcgggagggagaagggaagtaatgatgaggattggcc 

gaaaaggtgggtggctggccatgatggaccttccatctgcagggtttcataggactgcgc 

attcacagccagagatggacttggcagtgggctgaagacgctgtccactctgccaccttg 

ggtttacctctctcatgcaggtcactgtttccactgtaataggagagtttgtttggatgc 

ctgggtgctaggacaggtaacacagaagcttaggatggtagcaggggaagcattttttgg 

cagatggccagacatggtaagtgtgagaggagtctgcctgatacacgattgacttttgag 

ctggggatatttgggcttcactgtgatcattcagcccccaggggaggagattgtaacgtt 

agaaagagtaggatatcgttgggagagecacttagttgtgtcctttctctcccgatcagg 

gcagaacatctgaatttgcctgaaccctgttctctgttttgcccattatagaattaaaaa 

atgtctctgtgtggactgtttttttgcagccagtcttaatcctgcttgctgaaatttgag 

ctcacttctccatgttctccttgagaacggaaccatcgtccctaagccctgagtgaaatc 

acaccagcttaaggccactgctctgccactcctcagccttttcttgtttgttatctccgg 

gaagttttgtacactttggttgtttcagtttctgttcatgagtagtcttctttcttggct 

gaacgtctagattgggactctctctgcagagaaccggtactgaagcaactgtcattttca 

gtttttgtttcatttggctttttctttagctgttcaccttattagcaaggcagcccatga 

ccttgacttgccacagttccaaaacacaaattcttacagatcggtttgtgctagtgtctg 

gcaggtgtcctgccctccctcgttacctcctcatttgtgcctgcccaccttcccagagcc 

tgcgtcttctcagatgcttaacacctgtttagcctctctagttcagagctacaaatttac 

atgcttgattctgtggggcagaaagttcaaagtaatttcttcctctgcaaattcccagta 

tcttagtcacacgcaaagagagtgtccctgtgcactgactcctctagctagtgatttgtc 

agccaaaaatgtttatttatctcctggcctgtctcctcccatatcagtatggccacatga 

acagaattgagtgacctcctgagtccctgtattaggaaggggaaagatcttttgattcat 

taaccattaagttgattcattaaccattaagtcttgggcctgcagaccatagcaaccttc 

cttccttcatttacggtgcttcatccagctccaaatcttctctactttgtcctcacaaac 

t«tcatatgccctagtagcrcatagactgctccttat:atctggaaagcaacattcaaac 

ttctcatttctggttccaaaaatccgtgcattacatggataggctgccntgggggacatt 

cngcggccctcacgatgtggtttcccacagagaagccaggagaacgaggaggacggtgag 

gaggaccctgaccgctatgtctgtagtggggttcccgggcggccgccaggcctggaggaa 

gagctgaccctcaaatacggagcgaagcatgtgatcatgctgtttgtgcctgtcactctg 

tgcatgatcgtggtggtagccaccatcaagtctgtgcgcttctacacagagaagaatgga 

cagctgtgagttggggggctggggggagcagggtggggtgagggctgagttgccaggggg 

tggggggcgcagcagcctgtgttggtcactgtacctgcagctccacaccagcagcggtaa 

agagcagggatgaagaaccgcccaggttcatggcctggctcactgcctcctggattgtga 

cctacttgggcatgcttttaacatccctatgcctcagcttccttgttcgtataatgggtt 

gataacgcagttactgggagaattaagtgagttaatatgagtgaagggcttagaagagtg 

tctactgcacgtgagtgctcaggcaagctggatcctgctgcagaaagcaagctcttgatc 

ctgggcatggctgtgccactgatccctgtgtgactgcaaacaaatcacttcctctctgag 

tctctgcttccctgaatgtgaaacaaggtggttggaccagatatttctcagctcacttcc 

agccttgtgaggaagacttataaagcctttcgtttattttagtaaaatacatgcagaggc 

agcagcgtagaaaaatgagaagcttcctceacttcttccccctcccctttctgtggtcct 

cactgctaagcaccttctgtaaacttttttttttttttttaaagttaggganttttgttt 

catttcgtgtgtgttggttttttttgttgttgttgtttcttttaaagaaaggaataaggc 

caggtgtggtgtctcatgcctgtaatcccagcactttgggagactgaggtgagaggatta 

tttgagcccaggagtttgagaccagcctgggaaatgtggcgagaccctgtctgtacaaaa 

aatgcaaaaattagccaggtgtggtggtacatgcctgtagtctcagctacttgggagact 

gaggtggaagaacacctgagcccagaagtcgaggctgcagtgagccatgattgcgccact 

gcactgcagcctcagcaacagagtgagaccctgtctcaaaatttttttaa 
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FIGURE 16 

Genomic sequence of stm.genD 
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FIGURE 17 

Genomic sequence of stm.genE 

=9™"=tctgtgtatggtag,^g,cageaaggI4g?fglw^ 
^^""^""Mtgctgeac.aggaggg^ 

^^^^^'^'''"""''^•a'^cctlJtgllaaacLagglg 

5™ ! 2 """"^^atggagtcttgetctgtcgcceaggetggagtata 
gtgg<^ccatcttggstgaetgcaacctecgccteccggg?tcLgtg!t?c?Ieta?« 
"?f=tectgagtagataggattat*g,qatgtgecaI2Iegctg^ 

tttt.,t»gag.tg,ggtttc.ccat^tg,2c.,,ctg»tS3IIS"Sa=Hw 
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FIGURE 18 

Genomic sequence of stm.genF 




ctgtgatcccqcaqccac 

=»9W9~ct==ag 9 t cc=I?, 9 ,i? a ° 99 ^ 
SfflSSSSaSSSSSSS 
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FIGURE 19 

Genomic sequence of stm.genG 

cagsgtctcgctctgttgcceaggctggagtgcagragcacgattgtggctcactgcaaccttgacttcctQooc 

tcaagtgarcctcccaccttggcctcctgagtagctgggartacaggcacgtgctaecacaectggctrcatttat 

atttttagtagagacaaggttttgacttgttgcecgggctggtttcgaactcctgggctcaagtgatceacctge 

cttggccttccaaagtactgggattataggtctgagccacagcacctggccaaaattagttttattataagaqat 

gcaactcaggaccagccaaatgaagagacagtgaagaagtaatgctgatggatcacacetggtgggggaaogcgg 

acagctggggccaggagcaggagggacacctgcagggctggaagggcaggggaggtgggcccccatggtttgtgt 

ttattgcataaccatttttattgtctacagtgagcaaagttatectataaacaagtgtcagggaccattgcacta 

aagaaaacaaatgagagcattttggaagetctaatttcctgateagtaatgggtagactaattcccagtitatatt 

taeetgttgtaaggtgaaaggttcttcagaggacctctgtettggCgttatatgggettttgaatgtaetQaaat 

taaattccetaaaaanctgtgattcagacttcatactaaattgtacagcagtgcccagcceaaggccttgcattt 

c t atttgttgt:tttcttt«ctctctaagtgccc«ac.ctgg t tt:t:aectg.gtttcag..c t gcccgct«tctc 

tgcccaggttgtaagtcacccagtccacaggtgtcccctgctttcecactggccactgatttggggaggeagcto 

tccatgtccccagtccacatcttagcttctagaggccaggtggggtgggctgggctgggcaagagcagltgggcc 

ttctgggccagagtrtctcttctttttccattctgtgcacgcctcttcagtacgggttacrgtctctLtlalac 

agggggcgtgaagettggcctcggggacttcatcttctacagtgtgctggtgggeMggcggctgccaegggcag 

eggggactggaataccacgctggcctgcttcgtggccatcctcattgtgagtggctggggatgcgtccantgcc 

tegtggtgggggeccecagggtcctcattgtggtgggggcaggcctcaggatccctagggatttttcatttcttc 

tcttccctctgagggacaagageagggagcggggctggaagggtcagcttgagaccaaggctcaeaggaggtgtg 

ctcgcccctaggtgggctccagcctgtggaggacagtgcaggggagggtgaggagtgtaccggcccHgcgtLc 

tgagcacacagcctccaggccgaggacccagetgacagctttgcgcagtgatgataecetcgaggtggttgcgat 

g«catcagat«ge.gaa.ag.»aattgct:raagggcct t gcccat:gggegcaaag«ag t gaggaccal:gtttt 

ccccctcctccacgccattgggacaccacagggtctgaatctggggcactaggggtggccecgitactgtgaacc 

acagcagtgaaatgtggaggccctgtagtcagttaacgtggccagatacacataatggggagacgtcctgccgtg 

acttcatctcagagatttcgctgtcacgttagaggaggaggagcgtctgagccgtgcgcttggcatctgcccnt 

agtgaaaaecctgggcatggcatgattaaggttgaegctccagtgtccagaaggttttctttttgcccacaagta 

tatcagggatgggatggtggacccaggctcctccaecaccagactgcettacctgagccctgctggccecaaaga 

tatagaaggcaccctggtteectgtgctcacctggaccactgectgcatcagctgggtcaggggaggatgggcag 

eccccacacctgcttcceaggggcaggttgcctggcggcrctgattcccttggtgccagclg^glgaacntac 

tgccatttcagttgagcccacctagctetcatataaatacatgttccctgagggcatcttaccatcccatgtgac 

cactccagccagacaggggaggcagcacggcctcggggcacagcaccgctccaggagtcaggaggcctgccttct 

ggteeacteactaacaggtgaggtgaeetaatgggggtgagaacttctgcccttaacacctcaagagctgttgca 

ggaceagggaagataatggggtgtctagcgccgttatccgaccggtcctcgaacaagctcctgtgcccagggact 

agaccatgactcacagctcctgtccacaccagggatcaccacgctcaccctcccctccacgtcctgcagggcttg 

tgtetgaccctcctgctgcttgctgtgttcaagaaggcgctgcccgccctccccatctccatcacgttcgggctc 

atettttacctctecacggacaacctggtgcggccgttcatggacaccctggcctcccatcagctctacatctga 

gggacatggcgtgccacaggctgcaagcegcagggaactttcattggatgcagttgtatagttttacactctagt 

!!""^ tttttaa9aCt " tCtttCett *" a " ta "9 tac 9 t 9tttacttggtgaggaggaggcagaace« 

gctctttggtgccagctgtttcatcaccagactttggetcccgctttggggagcgcctcgcttcacggacaggaa 

gcacagcaggtttatccagatgaaetgagaaggtcagattagggeggggagaagagcatccggcatgagggctga 

gatgcgcaaagagtgtgctcgggagtggcccctggcacctgggtgcrtctggctggagaggaaaagccagttecct 

acgaggagtgtteccaatgctttgtccatgatgtccttgttattttattgcctttagaaactgagtcctgttctt 

gttacggcagtcacactgctgggaagtggcttaatagtaatatcaataaatagatgagtcetgttagaatcttgg 

•gtttggtccgttgtaaatgttgacccctctccctgcatcttgggcacccctgggataacttgtgctgtgagccc 

•ggatggaggcagtttgccctgtttgaaggaacttttaatgatctegcctctctgcacacatttctttaactaga 

aagtttcctaagcaaaggagttaggagagcagggtggcctgacatctgccagccctgagctgtaaggetgtggat 

gctgagcaggtccctggactcagttgtgcacggtggcacagacactgccaggtggttgccaaaacatccagtggt 

tcc-tcagcaagtgttcaccctctgcagaagcctgtgagggcctgagctcagaaaccactctccttrccttctct 

ggctttggccctgggcactgtggtgggagagtggacagtttggctttgccttctctgtacatcaatcatgggttg 

caaagagaatetcagaagtgcetcttcctgagcacagtggctcacacctgtaatcccaatacttcgggaggtcga 

gtcggaaggatcacttgagcccaggagtttgagaccagccggggcaacatagtgagactttgtacaaaaaaaaat 

ttaaaaattagccaagcatggtggcatgcatctgtagtctcagctactctggaggctgaggtgggaggatcactt 

aagcccaggaggctgaggctgcaatgagccgagatcaagcaggtgttaggtatatcagacagctgagaagacgca 
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